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Description 

Field of the invention 

5 The invention relates to the field of genetic engeneering and provides novel DNA molecules which comprise a 

DNA sequence coding for the most abundant protein in the supernatant of cultures of Lactococcus lactis , hereinafter 
named Major Secretion Product (MSP), for its signal peptide and/or the promoter of the MSP gene. The molecules 
may also comprise multifunctional origin of replication. The novel DNA molecules are used for the production of novel 
shuttle vectors for cloning of DNA in at least E. coli and Lactococcus spec, or novel vectors for the expression of 

10 homologous or heterologous genes and the production of secreted gene products in gram positive bacteria such as 
Lactococcus spec, and Bacillus spec. 

Background of the invention 

15 Although in genetic engineering techniques numerous procaryotic vector-host -systems for cloning of heterologous 

or homologous genes are already known, there is a continuous need for novel systems which may have advantages 
over the known systems. 

Most recombinant work in DNA technology has been carried out with bacteria such as Escherichia coli or Bacillus 
subtilis. The lactic acid bacteria, however, are of much more industrial interest. For this reason, a number of efforts 

20 have been made to develop plasmid vectors for cloning and expression of homologous and heterologous genes in 
lactic acid bacteria, particularly in Lactobacillus spec, and Lactococcus spec, [see PCT WO 85/03945, Gasson and 
Anderson (1985), EP-A-0 316 677, Bates et al. (1989), Jos et al. (1985) and review articles of Chassy (1987) and of 
De Vos (1987)1. Lactococcus spec, was formerly named Streptococcus spec . 

Lactococcal strains investigated so far harbour a characteristic plasmid complement consisting of multiple different 

25 plasmids. This property can be used to differentiate between various lactococcal strains (Davies et al., 1981). For 
genetic studies, plasmid free strains have been constructed by repeated curing in the course of plasmid function studies 
(De Vos, 1987). 

For example, the plasmid complement of L. lactis 71 2, hereinafter also called L. lactis LL71 2, consists of 5 plasmids 
having the molecular weights of 1.8 Md, 2.5 Md, 5.2 Md, 9 Md and 33 Md which are named pSH71, pSH72, pSH73, 

30 pSH74 and pLP71 2, respectively (Gasson, 1 983). 

Based on plasmid pSH71 of L. lactis and on the related L. cremoris plasmid pWV01 (Otto et al., 1982) various 
cloning vectors have been constructed. The cloning vectors have been produced either by inserting genetic markers 
such as antibiotic resistance genes into the plasmids or by screening fragments of the plasmids for an origin of repli- 
cation function, i.e. for the ability to sustain replication of selected DNA fragments. A cloning vector produced according 

35 to the latter method is pNZ12 which contains the 1.7 kbp Clal restriction fragment of pSH71 comprising the origin of 
replication (Gasson and Anderson, 1985). The origin of replication of pSH71 is also functional in other gram-positive 
bacteria such as Bacillus spec, and in the gram-negative Escherichia coli. 

On the basis of these plasmids cloning vectors useful for the introduction and expression of homologous or het- 
erologous genes in lactic acid bacteria have been developed. The development of the cloning vectors resulted in 

40 transformed lactococcal strains with improved properties which are useful in food and feed industry, for example a 
bacteriophage resistant L. lactis strain (EP-A-0 316 677) or a L. lactis strain which produces bovine prochymosin (PCT 
WO 85/03945). The development of cloning vectors for the production of homologous or heterologous gene products 
is not only of interest because of the production of improved lactic acid bacteria cells but also for the production of 
recombinant proteins. One of the major problems with the production of heterologous proteins in microbial expression 

45 systems has been the purification of the product Purification of intracellular proteins is time-consuming and often results 
in poor yields. Purification can be considerably facilitated if the product is secreted from the host cell. To avoid the 
problems of purification of the products expressed in bacteria, vector-host systems for the production of recombinant 
proteins which are secreted into the supernatant can be useful. 

Another advantage of secreted proteins can be that they can have a native and biologically active conformation, 

50 because then no refolding process is needed. Refolding is usually necessary if the polypeptide is intracellular^ de- 
posited. 

Secretion of a protein usually requires a signal peptide at the amino terminus of the primary translation product 
which directs the protein into the secretory pathway. It is of advantage if the signal peptide is cleaved enzymatically 
from the protein during the translocation through the cell membrane. This, however, is not always the case. 

ss 

Object of the invention and solutions 

It is an object of the invention to provide novel hybrid vectors which can replicate in gram positive and gram negative 
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bacteria and/or which allow expression of homologous or heterologous genes and secretion of stable protein products 
into the supernatant. 

In particular, this object has been achieved by the present novel hybrid vectors comprising a novel DNA insert 
which comprises the promoter region, the DNA sequence coding for the signal peptide and the coding region of a 
5 hitherto unknown gene encoding a polypeptide which is the most abundant protein in the supernatant of L. lactis as 
judged after TCA precipitation of the supernatant, SDS-polyacrylamide electrophoresis of the precipitated proteins and 
staining of the gel with Coomassie brilliant blue and which is called herein Major Secretion Product (MSP). 

Accordingly, another solution to an object of the invention is to provide novel hybrid vectors comprising the promoter 
region, the DNA sequence coding for the signal peptide and/or the coding region of the MSP gene or of a related gene 
to and an origin of replication derived from the 2.5 Md or 5.2 Md plasmid of L. lactis LL712 or an origin of replication 
related thereto. 

The invention concerns also functional fragments of the novel DNA inserts or the origins of replication per se. They 
are useful for the production of novel expression vectors for the secretion of homologous or heterologous proteins from 
lactic acid bacteria. 

J5 The invention further provides a method for the preparation of the novel DNA molecules and hybrid vectors, and 

a method for the production of secreted gene products by means of hosts transformed with a novel hybrid vector of 
the invention. 

The invention provides also the MSP protein in pure form. 
20 Detailed description of the invention 
DNA molecules 

The invention concerns a hybrid vector comprising 

25 

a) the approximately 3.5 kbp EcoRI/Sall L. lactis insert of the plasmid pUCRS deposited under the accession 
number DSM 5803, or a functional fragment thereof, or 

b) a DNA sequence which hybridizes with said insert or with a functional fragment thereof, or comprises a promoter 
30 region which is naturally operatively linked to such a hybridizing DNA sequence, or 

c) a degenerate sequence of a DNA sequence which is covered in a) and which encodes a signal peptide, or 

d) a derivative of a DNA molecule covered in a), b) or c), 

35 

The vector may also comprise the origin of replication of (I) the 2.5 Md plasmid of L lactis LL7 12 deposited under 
the accession number DSM 5804, or of (II) the 5.2 Md plasmid of L. lactis LL712, or of (III) a plasmid of the same 
incompatibility group as the 2.5 Md plasmid or the 5.2 Md plasmid of L. lactis LL712. 

The approximately 3.5 kbp EcoRI/Sall L lactis insert of the plasmid pUCRS comprises the promoter region of the 
40 MSP gene, the DNA sequence encoding the MSP signal peptide and the coding region for MSP. MSP with its signal 
peptide attached is hereinafter named pre-MSP. The EcoRI/Sall insert comprises the 1920 bp long DNA sequence 
with the SEQ ID No. 1 depicted in the sequence listing in such orientation that base position 1 is proximal to the EcoRI 
site. 

The coding region for MSP extends from base position 492 up to base position 1793 of the sequence with SEQ 
45 ID No. 1. 

The DNA sequence which encodes the 27 amino acid long MSP signal peptide, extends from the end of the 
promoter region defined hereinafter up to the beginning of the DNA sequence coding for the mature MSR i.e. from 
base position 411 up to position 491 of the sequence with SEQ ID No. 1. The MSP signal peptide or a functional 
fragment thereof causes a protein to be excreted from a host cell, e.g. from a lactic acid bacterium, such as Lactococcus 
so lactis, or from Bacillus spec , such as B. thurinqiensis , if the N-terminus of the protein is covalently linked to the C- 
terminus of the signal peptide. Covalent linkage of the C-terminus of the MSP signal peptide and the N-terminus of a 
protein which is to be excreted from a host cell can be obtained by the production of a fusion gene in which the entire 
DNA sequence encoding the MSP signal peptide or part thereof encoding a functional fragment of the MSP signal 
peptide comprising the C-terminus is linked in proper reading frame to the 5' end of a structural gene encoding the 
55 desired protein. Such a fusion gene is, for example, the DNA molecule with SEQ No. 2 which comprises the DNA 
sequence encoding the MSP signal peptide and the hirudin structural gene. 

The amino acid sequences of the MSP signal peptide and of MSP are also given in the sequence with SEQ ID No. 1 . 

The promoter region of the MSP gene is located upstream of the DNA sequence encoding the MSP signal peptide 
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and comprises up to about 1500, preferably up to about 100 to 1000 nucleotides. In the approximately 3.5 kbp long 
EcoRI/Sall L. lactis insert of the plasmid pUCRS, the promoter region extends from the EcoRI cut end of the insert 
located upstream of the base corresponding to position 1 of the DNA sequence with SEQ ID No. 1 up to base position 
411. More particular, the promoter region extends from the first Hindlll restriction site located upstream of the base 
5 corresponding to position 1 up to base position 41 1 . The promoter region can bind RNA polymerase as well as regulatory 
proteins and can control the expression of a structural gene operatively linked therewith. 

The term functional fragment includes those DNA fragments retaining promoter, signal and/or structural functions. 

Preferred functional fragments are those containig the MSP promoter region, the DNA sequence encoding the 
MSP signal peptide or the promoter region and the DNA sequence encoding the MSP signal peptide. Fragments 
10 according to the invention are also composed of smaller fragments retaining the MSP promoter activity and/or coding 
for a peptide with signal peptide activity. 

Fragments with promoter function, for example, are those starting with the first base at the EcoRI cut end of the 
approximately 3.5 kbp EcoRI/Sall L. lactis insert or, in particular those starting at the Hindlll site and extending up to 
a base corresponding to about position 410 of the sequence with SEQ ID No. 1 . Other fragments with promoter function 
15 are selected from the group of fragments starting with any one of the bases between the EcoRI and the Hindlll site 
and ending with a base in about position 410 of the sequence with SEQ ID No. 1 . Shorter fragments of the promoter 
region also retain promoter activity. 

A DNA fragment which encodes a signal peptide extends, for example, from about base 411 up to base 491 of the 
sequence with SEQ ID No. 1 . It can be extended at the 5' end with a fragment of the promoter region which does not 
20 retain promoter activity. 

A fragment retaining MSP promoter function and encoding a signal peptide extends from the EcoRI cut end of the 
insert located upstream of the base corresponding to position 1 of the DNA sequence with SEQ ID No. No. 1 down to 
base position 491 . Another fragment retaining MSP promoter function and encoding a signal peptide extends from the 
first Hindlll restriction site located upstream of the base corresponding to position 1 of DNA sequence with SEQ ID 
25 No. 1 up to about the base corresponding to position 491. Other fragments retaining MSP promoter function and 
encoding a signal peptide are selected from the group of fragments starting with anyone of the bases between the said 
EcoRI and Hindlll sites and ending with a base in about position 491 of the sequence with SEQ ID No. 1 . 

The fragments may contain linkers which provide for succesful linkage to other DNA molecules. Suitable linkers 
to above fragments have a DNA sequence which fits into the restriction site of the DNA to which the fragment is to be 
30 linked. They may also contain a predetermined restriction site. 

A DNA molecule which hybridizes with said insert is hybridizing under conventional conditions. A conventional 
hybridization procedure is described e.g. by Benton and Davis (1977). Such a hybridizing DNA molecule comprises, 
for example, as hybridizing DNA sequence a variant of the structural gene encoding MSP. Such a hybridizing variant 
is hereinafter also referred to as a structural gene related to the MSP gene. The protein encoded by such a related 
35 gene is referred to as a protein related to MSP Accordingly, a promoter region which is naturally operatively linked to 
a DNA sequence hybridizing with said insert is a promoter region of a gene related to the MSP gene. 

A structural gene related to the MSP gene is, for example, a naturally occuring variant derived from another bac- 
terium than L. lactis , particularly from another lactic acid bacterium, for example from Lactobacillus spec, or Lactococcus 
spec , e.g. L. cremoris or L thermophilus . It is also a naturally occuring variant which is encoded by an isogene on the 
40 chromosome of or on a plasmid naturally occuring in L lactis . 

A DNA molecule comprising a gene related to the MSP gene can be isolated according to conventional methods. 

DNA molecules of the invention are also such having degenerated DNA sequences. They are degenerated within 
the meaning of the genetic code in that an unlimited number of nucleotides are replaced by other nucleotides without 
changing the amino acid sequence for which they code. Molecules having such degenerate DNA sequences may be 
45 useful because of their different restriction sites or because of a preferred codon usage in a particular host. Preferred 
are degenerated sequences of the DNA sequence coding for the MSP signal peptide or of a variant thereof. 

The term derivative when used in connection with a DNA molecule covered in a), b) or c) includes fragments, 
mutants or larger derivatives of such a DNA molecule. The term derivative includes also larger derivatives of fragments 
or mutants of such a DNA molecule. Preferred are fragments retaining promoter, signal or structural functions. Exam- 
50 pies of such fragments are given above. 

A mutant of a DNA molecule described in a), b) or c) is, for example, a DNA molecule having a deletion, insertion, 
inversion, or point mutation which may occur naturally or may be artificially introduced into the DNA molecule in vivo 
or in vitro according to conventional methods. A mutant can show an altered restriction pattern. 

Larger derivatives of a DNA molecule covered in a), b) or c) are, for example, excisable from the L. lactis genome. 
55 They can be found in a genomic library of L lactis LM0230 obtained by fragmentation of the nucleic acids, treatment 
of the fragments with a suitable restriction enzyme, e.g. Sau3AI, EcoRI, BamHI or Hindlll, ligating into a suitable vector, 
e.g. the lambda phage A.EMBL3 or the plasmid pBR322, cloning, e.g. in E. coli, and excising again, with the same or 
another suitable restriction enzyme. 
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Larger derivatives of a DNA molecule covered in a), b) or c) are also recombinant DNA molecules with flanking 
sequences, for example such comprising linkers which provide suitable restriction sites or which put regulatory se- 
quences, such as a promoter, a DNA sequence encoding a signal peptide or a terminator, and a structural gene into 
the correct distance or reading frame, or such comprising sequences derived from a vector, e.g. from a phage or 
5 plasmid used in the construction of a hybrid vector. The flanking sequences comprised in the larger derivatives may 
give rise to fusion genes. 

An origin of replication derived from the L. lactis LL712 2.5 Md or 5.2 Md plasmid or from a plasmid of the same 
incompatibility group as the 2.5 Md plasmid or the 5.2 Md plasmid can cause a plasmid to replicate in acid bacteria, 
for example in lactococci, e.g. in L lactis L lactis diacetylactis or L. cremoris, and in E. coli or also in gram negative 
10 bacteria other than E. coli or in gram positive bacteria other than lactic acid bacteria, e.g. in Bacillus spec, such as B. 
thurinqiensis. 

An origin of replication which is derived from the L. lactis LL712 2.5 Md or 5.2 Md plasmid or from a plasmid of the 
same incompatibility group as the 2.5 Md plasmid or the 5.2 Md plasmid is, for example, comprised in the whole 
respective linearized plasmid obtainable after treatment with a suitable restriction endonuclease. Accordingly, a hybrid 
'5 vector of the invention is also such comprising the whole DNA sequence of the 2.5 Md or the 5.2 Md plasmid or of a 
plasmid of the same incompatibility group as the 2.5 Md or 5.2 Md plasmid. Both plasmids can be isolated, for example, 
from the strain L. lactis LL712 which is deposited with the DSM as DSM 5804. 

A DNA sequence which functions as an origin of replication is also a fragment of any of said plasmids retaining 
origin of replication function. Such a DNA fragment can be obtained, for example, by isolating it after the fragmentation 
20 of the respective plasmid or of a fragment thereof, for example, with physical forces, e.g. sharing forces, or with chemical 
cleavage reactions, or with enzymes cleaving DNA such as nucleases, e.g. the exonucleases Bal31 , S1 or exonuclease 
III, or endonucleases, e.g. restriction endonucleases. In particular, such a DNA fragment is a restriction fragment ob- 
tained after treatment of the respective plasmid or fragment therof with one or with two different restriction endonucle- 
ases which are in the case of the 2.5 Md plasmid, for example, Ndel, Sphl and/or EcoRV and in the case of the 5.2 
25 Md plasmid, for example, Sau3A, Ndel, Hindlll, Accl and/or EcoRI. 

The origin of replication of the 2.5 Md plasmid is located, for example, on the approximately 1.8 kbp long Ndel/ 
Sphl fragment : the approximately 3.1 kbp long EcoRV/Sphl fragment obtainable after partial EcoRV digestion of the 
approximately 3.5 kbp long Sphl fragment, the approximately 3.5 kbp long Sphl fragment, the approximately 1.2 kbp 
long Ndel/EcoRV fragment, the approximately 2.5 kbp long EcoRV fragment, or the approximately 3.5 kbp long EcoRV/ 
30 Sphl fragment obtainable after partial EcoRV digestion of the approximately 3.5 kbp long Sphl fragment. The approx- 
imately 3.5 kbp long Sphl and the other fragments mentioned hereinbefore are also comprised in pSC1 2. The production 
of pSC12 is described in Example 1.2. A restriction map is given in Figure 2. 

The origin of replication of the 5.2 Md plasmid is located, for example, on the approximately 1 .0 kbp long Ndel/ 
Hindlll fragment, the approximately 2.2 kbp long Ndel/Accl fragment, the approximately 2.7 kbp long Ndel/EcoRI trag- 
us ment, the approximately 3.1 kbp long NdeVSau3A fragment, the approximately 2.0 kbp long Sau3A/Hindlll fragment, 
the approximately 3.2 kbp long Sau3A/Accl fragment, the approximately 3.7 kbp long Sau3A/EcoRI fragment, or the 
approximately 4.1 kbp long Sau3A fragment. The fragments with Sau3A cut ends are obtainable after partial Sau3A 
digestion of the 5.2 Md plasmid, of pSC18 or of suitable restriction fragments of these plasmids. The production of 
pSC18 is described in Example 1.3. A restriction map is given in Figure 3. 
40 Plasmids of the same incompatibility group cannot stably coexist in the same cell. They carry related origins of 

replication. 

A recombinant DNA molecule does, for example, contain linkers and/or sequences derived from a vector such as 
from a phage or plasmid, and optionally a marker gene such as a resistance marker gene, e.g. an erythromycin, 
ampicillin, or tetracyclin resistance marker gene or the like. A recombinant DNA molecule comprising an origin of 
45 replication of the present invention is, for example, a vector plasmid for lactic acid bacteria, preferentially a shuttle 
vector which can replicate at least in lactic acid bacteria and E. coli and wherein a lactococcal derived origin of the 
invention is the only origin of replication. 

A preferred shuttle vector is pSC 12, pSC12AP, pSC12AN, pSC12ANP, pSC18, pSCl8AP, orpSC18AN. Apreferred 
shuttle vector for the expression of a foreign gene in lactic acid bacteria ispSC12HIR, pSC12HIR-1 or pSC12HIRTerm. 
so The preferred shuttle vectors are described in the Examples. 

A DNA sequence which functions as an origin of replication in at least lactic acid bacteria and E. coli which is 
intended to be covered above under e) is also an inversion, insertion, deletion or point mutant of a DNA sequence 
defined hereinbefore which retains the function of an origin of replication. Such mutants have an altered nucleotide 
sequence in the DNA sequences flanking the real origin of replication, for example point mutations which cause an 
55 altered pattern of restriction sites or deletion mutants such as the L. lactis inserts of the plasmids pSC12AP, pSC12AN, 
pSC12ANP, pSC18ANP, or pSC18AN (see Table 1). The real origin of replication mentioned hereinbefore is intended 
to be the smallest DNA fragment which functions as an origin of replication. 

The hybrid vectors of the invention are usable for cloning in hosts such as fungi or in particular bacteria. They can 
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be derived from any vector useful in the art of genetic engineering, such as from phages, cosmids or plasmids. They 
are, for example, derivatives of phage X, e.g. NM 989 or EMBL 3, of phage Mi 3, e.g. M13mp18 or M13mp19, of 
bacterial plasmids, e.g. pBR 322, pUC18 or pUC19 or plasmids of lactic acid bacteria, or of yeast plasmids, e.g. yeast 
2\i plasmid, or also of defective phages or defective plasmids in the presence of a helper phage or a helper plasmid 

5 allowing replication of said defective phages or plasmids. 

A hybrid vector of the invention comprises a nucleotide sequence of a DNA molecule covered in a), b), c) or d). It 
comprises in particular the approximately 3.5 kbp EcoRI/Sall L. lactis insert, a fragment thereof coding for the MSP 
signal peptide and/or the promoter of the MSP gene. Depending on the type of the DNA molecule of the invention 
inserted in the hybrid vector, the vector may comprise a hybrid expression control sequence. Such a hybrid expression 

10 control sequence partially consists of a DNA molecule covered in a), b) or c) and partially of a DNA molecule different 
from a DNA molecule covered in a), b) or c). For example, the hybrid vector can comprise the MSP promoter in com- 
bination with a DNA sequence coding for a non-MSP signal peptide a DNA fragment coding for the MSP signal peptide 
in combination with another than the MSP promoter. Additionally, a hybrid vector of the invention optionally comprises 
a transcriptional terminator region, e.g. the trpA transcription terminator of E. coli. 

f5 Hybrid vectors comprising a DNA sequence coding for a signal peptide of the present invention, i.e. derived from 

the gene for pre-MSP or a related gene, are useful for the production of secreted gene products in lactic acid bacteria : 
particularly in Lactococcus spec , especially in L lactis or L. cremoris or in other bacteria, for example a Bacillus spec. 
such as B. thuringiensis . Such a hybrid vector comprises a promoter which functions in the desired bacterium, for 
example the MSP promoter for the expression in lactic acid bacteria, and optionally a transcription terminator region 

20 which functions in the desired bacterium. The use of a terminator can increase the yield of the recombinant gene 
product if it is operatively linked with the respective homologous or heterologous structural gene. For example, if the 
trpA terminator of E. coli is operatively linked with a hirudin gene (EP-168 342) which is expressed under the control 
of the MSP promoter and a DNA molecule encoding the MSP signal peptide in lactic acid bacteria, the production of 
desulfatohirudin (vgl. EP-142 860) is markedly increased. 

2S a hybrid vector of the invention also comprises an origin of replication which functions in the desired host, e.g. in 

lactic acid bacteria and/or Bacillus spec, and/or E. coli. 

A DNA sequence bearing an origin of replication which functions in at least lactic acid bacteria is, for example, the 
1.7 kbp Clal fragment of the lactococcal plasmid pSH71 (Gasson and Anderson, 1985) or one of the DNA fragments 
defined hereinbefore which comprise the origin of replication of the 2.5 Md plasmid or of the 5.2 Md plasmid of L. lactis 

30 LL712. 

A hybrid vector of the present invention may contain a selectable marker. The choice of the marker depends on 
the host which is to be transformed, selected and cloned. Any marker gene can be used which facilitates the selection 
of transformants due to the phenotypic expression of the marker. Suitable markers are particularly those expressing 
antibiotic resistance, e.g. against erythromycin, tetracycline or ampicillin, or in the case of auxotrophic mutants, genes 

35 which complement host lesions, e.g. genes of the lactose metabolizing pathway. 

Preferred embodiments of the present invention are hybrid vectors comprising a homologous or heterologous 
structural gene : for example the MSP or especially a heterologous structural gene defined hereinafter, which is oper- 
atively linked in a proper reading frame to the DNA sequence encoding the MSP signal peptide and which can be 
expressed under the control of the MSP promoter or a heterologous promoter. 

40 Structural genes may originate from viruses, procaryotic cells or eucaryotic cells and may be derived from genomic 

DNA or from cDNA prepared via the mRNA route, or may be synthesized chemically. They may code for a wide variety 
of useful polypeptides, including glycosylated polypeptides, in particular of higher eukaryotic, especially mammalian, 
such as animal or especially human origin, such as enzymes which can be used, for example, for the production of 
nutrients and for performing enzymatic reactions in chemistry, or polypeptides which are useful and valuable for the 

45 treatment of human or animal diseases or for the prevention thereof, for example hormones, polypeptides with immu- 
nomodulatory, anti-viral and anti-tumor properties, antibodies, viral antigens, vaccines, clotting factors, foodstuffs, and 
the like. 

Examples of such heterologous structural genes are e.g. those coding for hormones such as secretin, thymosin, 
relaxin, calcitonin, luteinizing hormone, parathyroid hormone, adrenocorticotropin, melanocyte-stimulating hormone, 

50 p-lipotropin, urogastrone or insulin, growth factors, such as epidermal growth factor, insulin-like growth factor (IGF), e. 
g. IGF-I and IGF-II, mast cell growth factor, nerve growth factor, glia derived nerve cell growth factor, or transforming 
growth factor (TGF), such as TGFp, growth hormones, such as human or bovine growth hormones, interleukin, such 
as interleukin-1 or -2, human macrophage migration inhibitory factor (MIF), interferons, such as human a-interferon, 
for example interferon-aA, aB, aD or otF, p-interferon, y-interferon or a hybrid interferon, for example an aA-aD- or an 

55 aB-ccD-hybrid interferon, especially the hybrid interferon BDBB, proteinase inhibitors such as a, -antitrypsin, SLPI and 
the like, hepatitis virus antigens, such as hepatitis B virus surface or core antigen or hepatitis A virus antigen, or hepatitis 
non A-nonB antigen, plasminogen activators, such as tissue plasminogen activator or urokinase, tumour necrosis factor, 
somatostatin, renin, p-endorphin, immunoglobulins, such as the light and/or heavy chains of immunoglobulin D, E or 
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G, or human-mousG hybrid immunoglobulins, immunoglobulin binding factors, such as immunoglobulin E binding factor, 
e.g. sCD23, calcitonin, human calcitonin-related peptide, blood clotting factors, such as factor IX or Vlllc, erythropoietin, 
eglin, such as eglin C, hirudin, desulfatohirudin, such as desulfatohirudin variant HV1, HV2, HV3, [Leu 1 ,Thr 2 ]-HV1, 
[Lys 47 ]-HV2 or PA, human superoxide dismutase, viral thymidin kinase, p-lactamase, glucose isomerase. A preferred 

s gene is this coding for desulfatohirudin, e.g. variant HV1. In the hybrid vectors of the present invention, the present 
promoter and/or the DNA molecule encoding the MSP signal peptide is operably linked to the polypeptide coding region 
so as to ensure effective expression of the polypeptide. 

Preferred hybrid vectors are pUCRS, M13mp18RS, M13mp19H, pVAHIR, pVAHIR-1, pSC12HIR, pSC12HIR-1, 
and in particular pSC12HIRTerm. The hybrid vectors are described hereinafter in the Examples. 

10 The invention also concerns a DNA molecule which 

a) is the approximately 3.5 kbp EcoRI/Sall L lactis insert of the plasmid pUCRS deposited under the accession 
number DSM 5803, or a functional fragment thereof, or 

is b) hybridizes with said insert or with a functional fragment thereof, or comprises a promoter region which is naturally 

operatively linked to such a hybridizing DNA sequence, or 

c) is a degenerate sequence of a DNA sequence which is covered in a) and which encodes a signal peptide, or 

20 d) is a derivative of a DNA molecule covered in a), b) or c), per se. 

The molecule may also comprise the origin of replication of (I) the 2.5 Md plasmid of L lactis LL712 deposited 
under the accession number DSM 5804, or of (II) the 5.2 Md plasmid of L. lactis LL71 2, or of (III) a plasmid of the same 
incompatibility group as the 2.5 Md plasmid or the 5.2 Md plasmid of L. lactis LL712. 
25 These DNA molecules are useful for the preparation of the hybrid vectors of the invention or for screening DNA 

gene libraries or mRNA for further similar DNAs or mRNAs. 

Process for the Preparation of the DNA Molecules and of MSP 

30 a further object of the invention is a process for the preparation of a DNA molecule of the invention, i.e. of a hybrid 

vector or a DNA molecule defined hereinbefore, comprising 

A) culturing a host which comprises a DNA molecule of the invention and isolating a DNA molecule of the invention 
from such a cultured host, or 
35 B) preparing a DNA molecule of the invention by an jn vitro synthesis. 

The culturing of the hosts is carried out in a conventional nutrient medium which may be supplemented with or 
deprived of chemical compounds allowing negative or positive selection of the transformants, i.e. such hosts containing 
the desired DNA molecule together with a selection marker, from the non-transformants, i.e. such hosts lacking the 
40 desired DNA molecule. 

Any suitable transformable hosts useful in the art may be used, for example suitable bacteria, such as gram- 
negative bacteria, e.g. E. coli, or gram-positive bacteria, e.g. Bacillus spec, or lactic acid bacteria, such as Lactobacillus 
spec , or especially Lactococcus spec , e.g. L lactis L lactis diacetylactis or L. cremoris. 

Bacteria are transformed by a conventional method and the transformants are identified in conventional manner, 
45 e.g. by their resistance, for example against tetracycline. 

In particular the described hybrid vectors are propagated in suitable E. coli host strains, such as TG1, HB101 
(Maniatis et al (1982)), JM109, MH1 and the like, or in suitable Bacillus strains, or in suitable lactic acid bacteria, e.g. 
Lactococcus strains such as L lactis 0230, L lactis diacetylactis, L. cremoris and the like. The hosts are transformed 
and selected by methods conventional in the art. The propagated plasmid DNA is isolated from the bacteria by con- 
so ventional methods, for example as described by Birnboim & Doly (1 979). 

A DNA molecule of the invention can also be prepared by an in vitro synthesis according to conventional methods. 
The in vitro synthesis is especially applicable for the preparation of smaller fragments, e.g. of the DNA sequences of 
the MSP gene or of a related gene coding for the promoter or in particular for the signal peptide. 

A DNA molecule of the invention can be obtained from a lactic acid bacterium containing such a DNA molecule, 
55 in particular from a genomic library thereof or also via mRNA. 

In the following the preparation of a DNA molecule which is the approximately 3.5 kbp EcoRI/Sall L. lactis insert 
of plasmid pUCRS is described in more detail. 

As starting material a genomic library of Lactococcus spec, e.g. L lactis 0230, can be used which is prepared 
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according to conventional methods, e.g. by partial digestion of genomic DNA of a L. lactis strain, e.g. LM 0230, C2 or 
LL712, with a restriction enzyme, e.g. Sau3A or Mbol, and cloning the high molecular weight DNA fragments in a 
suitable vector, e.g. the E. coli plasmid pUN121 or a lambda phage, e.g. A.EMBL3. 

Any other strain of lactic acid bacteria producing MSP may also serve as source for the genomic library and likewise 
5 other suitable vectors may be used as recipient for the fragments. 

In order to successfully screen the genomic library for DNA sequences of the present invention a DNA probe is 
necessary which hybridizes with such a DNA sequence, e.g. with the MSP structural gene. This can be a synthetic 
DNA probe if the sequence of e.g. the MSP gene or part thereof is known. As neither the MSP protein nor the MSP 
gene sequence or part thereof was known prior to the invention, the problems of purification of MSP, of the determination 
10 of the N-terminus sequence of MSP and of the preparation of hybridizing DNA probes were solved first. 

MSP is the most abundant gene product in the supernatant of cultures of Lactococcus spec , e g L lactis as judged 
after TCA precipitation of the supernatant, SDS-polyacrylamide gel electrophoresis of the precipitated proteins and 
staining of the gel with Coomassie brilliant blue. 

For the purification of MSP any source containing it may be used, for example, the supernatant of a culture of 
is lactic acid bacteria, e.g. Lactococcus spec such as L. lactis . MSP can be purified from a SDS polyacrylamide gel by 
cutting out the gel piece comprising the major protein band and by eluting MSP. 

Further purification methods such as precipitation with acid, e.g. trichloroacetic acid, salting out, desalting, repre- 
cipitation in form of a different salt, dialysis chromatography, e.g. affinity chromatography, ion exchange chromatogra- 
phy, gel permeation chromatography, electrophoresis, e.g. with SDS-polyacrylamide gel, isoelectric focusing, electro- 
20 elution, and the like, or any combination thereof can also be applied in order to obtain pure MSP. 

In the present invention, a MSP with an apparent molecular weight of about 56 kD was isolated in pure form by 
precipitating the proteins from the supernatant of L. lactis LM0230 with trichloroacetic acid, by a subsequent electro- 
phoresis of the precipitated protein on a SDS polyacrylamide gel, cutting out the region with the major protein band 
and electroeluting MSP from the gel. This method is also suitable for the isolation of proteins related to MSP having 
25 another apparent molecular weight. The amino acid sequence of MSP was partly determined by sequencing and partly 
deduced from the DNA sequence. It is shown in the sequence listing under SEQ ID No. 1 and extends from position 
28 up to 466 of the amino acid sequence depicted. 

The pure MSP protein is also a subject of the invention. 

The sequencing of the N-terminus of MSP was performed in conventional manner and revealed the following amino 
30 acid sequence: 

X-X-Asn-Ser-Asp-lle-Ala-Lys-Gln-Asp-Ala-Thr-lle-Ser-X-Ala-Gln-Ser-Ala-Lys-Ala-Gln-Ala-Gln-Ala-Gln-Val^ The 

first two amino acids and the amino acid in position 15 which are indicated with "X" have not been determined. 

Based on the sequence of the amino acids in position 5 to 1 3 the following oligonucleotide mixture was synthesized: 

GAN 1 ATN 2 GCI AAN 3 CAN 3 GA^ GCI AC. In this nucleotide sequence, is T or C; N 2 is T, C or A; N 3 is A or G. 
3S A represents a nucleotide with the base Adenine, T with Thymine, C with Cytidine, G with Guanosine and I with Inosine. 

The oligonucleotide mixture was radioactively marked in conventional manner and used to screen a genomic library 

of Lactococcus spec , particularly L lactis LM0230. 

DNA probes containing sequences encoding MSP amino acid sequences and having at least about 14 bp can be 

used for screening for nucleic acids comprising related MSP genes or part thereof, which includes also the screening 
40 for pre-MSP 

For screening purposes the DNA probes are radioactively labelled at their 5' end by methods known in the art 
using y^P-ATP and T4 kinase. Host microorganisms or phages carrying nucleic acids of the present invention as an 
insert, are identified by hybridization with the labelled DNA probe on filter replicas of the gene library. 

The hybridization conditions used are conventional and may be more or less stringent, e.g. simply by choosing 
45 different temperatures. 

The hybridizing part of a DNA clone of the library which hybridizes with the oligo-nucleotide mixture was partly 
sequenced according to conventional methods. The determined sequence comprises nearly the entire functional MSP 
gene. The sequence is depicted in the sequence listing under SEQ ID No. 1. 

The 3.5 kbp EcoRI/Sall L. iactis insert can be isolated from positive clones of a genomic library by digestion with 
50 EcoRI and Sail and subsequent purification of the insert according to conventional methods, e.g. using agarose gel 
electrophoresis. Fragments can be prepared according to conventional methods and can be ligated into a suitable 
vector, e.g. into M13mp18 to generate M13mp18RS or into pUC18 to generate pUCRS. 

Likewise, any other L. lactis insert comprising MSP sequences, for example larger derivatives, variants or frag- 
ments of the 3.5 kbp EcoRI/Sall insert, or DNA molecules which hybridize with the 3.5 kbp EcoRI/Sall insert or comprise 
55 a promoter region which is naturally linked to such a hybridizing DNA molecule can be isolated from a genomic library 
of lactic acid bacteria. 

Fragments of the DNA molecule according to a) to c) may be obtained in conventional manner, e.g. by isolation 
of the fragments after digestion of the insert with suitable exo- or endonucl eases, e.g. with exonuclease III, Bal31 or 
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S1 or restriction endonucleases, such as Sau3A, Hindlll, and the like. Fragments may also be obtained by in vitro DNA 

synthesis according to conventional methods. 

Mutants of the DNA molecule covered in either of a) to c) hereinbefore, for example inversion, deletion, insertion 

or point mutants, can be prepared according to conventional methods, for example in vivo or in vitro by site-directed 
s mutagenesis (see review article of Zoller and Smith 1983, Botstein and Shortle, 1985, or Norris et al., 1983) using 

mutagenic oligonucleotide primers or by deleting DNA fragments between two restriction sites by cutting with suitable 

restriction enzymes and religating the DNA, optionally in diluted solution. 

In the following the preparation of a recombinant DNA molecule according to the invention comprising a DNA 

sequence which functions as an origin of replication is described in more detail. 
10 Plasmids are isolated from L. lactis LL7 12 according to conventional methods, e.g. as described by Birnboim and 

Doly (1979) with the modifications described in the examples, and are separated in conventional manner, e.g. using 

chromatography techniques, for example agarose gel chromatography as described in Maniatis et al. (1982). The 2.5 

Md plasmid and the 5.2 Md plasmid are isolated and fragmented according to conventional methods. 

The fragment mixtures thus obtained are ligated with a suitable vector according to conventional methods, e.g. as 
15 described in Maniatis et al. (1 982) and these ligation mixtures are used to transform in conventional manner a suitable 

intermediate host strain. 

A suitable vector carries a marker gene for selection in lactic acid bacteria, e.g. a resistance marker gene, for 
example the erythromycin resistance gene, and an origin of replication which does not function in lactic acid bacteria 
but in a bacterium suitable as an intermediate host for cloning the fragments of the L lactis plasmids. A suitable vector 

20 comprises also a marker gene for selection in the intermediate host which may be identical with the marker gene which 
functions in the lactic acid bacterium. A suitable intermediate host is, for example, an E. coli strain, e.g. E. col] TG1, 
and a suitable cloning vector then is an E. coli vector, for example a pUC1 8 derivative carrying an erythromycin resist- 
ance gene, such as pUC383, the construction of which is described hereinafter in the Examples. 

The intermediate host cells which are transformed with a vector comprising fragments of the L. lactis 2.5 Md or 

25 5.2 Md plasmid optionally may be selected in conventional manner which depends on the type of intermediate host 
cell and the vector used. Selection markers may be, for example, resistance markers or genes encoding a screenable 
marker enzyme, e.g. the product of the E. coli lacZ gene, p-galactosidase, provided that the host is an E. coli strain 
defective in the genomic lacZ gene. The selection marker genes may be disturbed by the insertion of a DNA fragment. 
If the marker is lacZ, the host cells which carry a vector with a fragment inserted into the lacZ gene are no more able 

30 to convert X-Gal into a blue dye and, as a consequence, positive clones on X-Gal comprising agar plates remain white 
after induction of the expression of the lacZ gene with IPTG. 

Vectors carrying an insert derived from the L. lactis 2.5 Md or 5.2 Md plasmid are tested for their ability to replicate 
in a plasmid free L. lactis strain, e.g. in L. lactis 0230. For this purpose the plasmids are isolated from the intermediate 
host and transformed into L. lactis 0230 cells according to conventional methods, e.g. as described in Powell et al. 

35 (1988). Replicating vectors comprise a DNA insert derived from the L. lactis 5.2 Md or 2.5 Md plasmid which functions 
as an origin of replication. A DNA molecule comprising this function may be isolated from the replicting vectors, frag- 
mentated, mutated and the like and can be used to construct recombinant DNA molecules according to the invention, 
e.g. cloning or expression vectors replicating in lactic acid bacteria. The L lactis 2.5 Md or 5.2 Md plasmid derived 
origins of replication are also functional in bacteria other than lactic acid bacteria, e.g. in Bacillus spec, or E. coli . 

40 Therefore, vectors comprising such DNA fragments can be used as shuttle vectors. 

Fragments of the 2.5 Md or 5.2 Md plasmid carrying an origin of replication can also be identified and isolated 
according to the method described hereinbefore from a fragment mixture which was obtained by fragmentation of the 
whole plasmid pool of L lactis LL712. Which of the cloned fragments carrying an origin of replication is derived from 
the 2.5 Md or 5.2 Md plasmid may be determined by hybridizing the cloned fragments with the plasmids of the plasmid 

45 pool derived from L lactis LL71 2 which were separated on an agarose gel or by comparison of the restriction pattern 
of the cloned fragments and of the plasmids of the pool. 

Recombinant DNA molecules according to the invention which comprise the whole DNA sequence of the L. lactis 
2.5 Md or 5.2 Md plasmid or a plasmid of the same imcompatibility group as the 2.5 Md or 5.2 Md plasmid may be 
obtained, for example, by cutting the respective plasmid with a suitable restriction endonuclease and ligating it, for 

50 example, with a DNA fragment comprising a homologous or heterologous structural gene, a promoter region or vector 
sequences or with a linker fragment or the like. Suitable restriction endonucleases have only one recognition and 
cleavage site in the respective plasmid. 

A recombinant DNA molecule comprising a sequence with origin of replication function derived from the 2.5 Md 
or the 5.2 Md plasmid can be obtained, for example, by a method comprising the preparation of the plasmid pool of L 

55 lactis LL712 according to conventional methods, the identification of the 2.5 Md or 5.2 Md plasmid, e.g. by estimating 
the molecular weight in agarose gel electrophoresis, fragmentation of the respective plasmid, e.g. with suitable restric- 
tion enzymes, preparation of a fragment comprising the origin of replication function, e.g. one of the fragments men- 
tioned hereinbefore, and ligating such a fragment or a mixture comprising such fragment with a DNA molecule com- 
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prising no origin of replication, e.g. a cloning vector depleted of its origin of replication, and selecting for DNA molecules 
replicating in at least L. lactis and E. coll. 

Plasmids comprising an origin of replication of the same incompatibility group as the 2.5 Md or 5.2 Md plasmid 
can be identified because they cannot be maintained together with the 2.5 Md or 5.2 Md plasmid, respectively, within 
5 the same cell. 

The invention concerns also the use of a DNA molecule or a recombinant DNA molecule of the invention for the 
preparation of hybrid vectors for the expression of a structural gene. Examples of such structural genes are given 
hereinbefore. The hybrid vectors can be prepared according to conventional methods using enzymes such as restriction 
enzymes, DNA polymerases, DNA ligases and the like. 
10 Example 4 and 5 describe exemplary in more detail the preparation of hybrid vectors for the expression of a 

structural gene in lactic acid bacteria, particularly in L lactis, or in Bacillus spec , particularly B. thuringiensis. 

Transformed hosts and process for the preparation thereof 

75 The invention concerns further a bacterial host transformed with a hybrid vector of the invention. 

Transformed bacterial hosts according to the invention are suitable for the cloning, amplification and/or preparation 
of a hybrid vector comprising a DNA molecule defined in of a) to e). The hybrid vectors can replicate in such hosts and 
are not lost under selective pressure in the cell population during proliferation. The host which can be used depends 
on the origin of replication comprised in the hybrid vector In the case that the hybrid vector comprises a DNA sequence 

20 with origin of replication function according to e), a suitable host is, for example, any strain of E col], Bacillus spec, or 
Lactococcus spec, which does not contain a plasmid with an origin of replication of the same incompatibility group. 

In the case that the hybrid vector comprises a homologous or heterologous structural gene fused in a proper 
reading frame with the DNA sequence coding for the MSP signal peptide, a transformed host of the invention is such 
which is suitable for the production of a secreted homologous or heterologous protein, e.g. a strain of L lactis. 

25 An example for a transformed host according to the invention is an E. coli strain, e.g. TG1 , C600 (Huynh et al 

(1985)) or HB101 transformed with pUCRS, pSC12, pSCl2AP, pSC12AN, pSC12ANP, pSC18, pSC18AN, pSC18AP, 
pVAHIR, pVAHIR-1, pSC12HIR, pSC12HIR-1, or pSC12HIRTerm, or L cremoris, or a plasmid-free L. lactis , e.g. L 
lactis LM0230, or a Bacillus strain, e.g. B. thuringiensis, transformed with either of these plasmids. Preferred are E. 
coli TG1 transformed with pUCRS, L. lactis transformed with pUCRS, pSC12 HIRTerm, pSC12 or pSC18, and B, 

30 thuringiensis transformed with pVAHIR or pVAHIR-1 . 

The invention concerns also a method for the preparation of such transformants comprising treatment of a host 
under transforming conditions with a recombinant DNA molecule of the present invention, especially a hybrid vector 
of the invention, optionally together with a selection marker gene and selecting the transformants. 

35 Process for the preparation of polypeptides 

The invention concerns further a method for the preparation of a polypeptide, characterized in that a homologous 
or heterologous structural gene is fused in proper reading frame with the DNA sequence coding for the MSP signal 
peptide, that a suitable host such as a gram positive bacterial host, for example a lactococcal host, e.g. L lactis LM0230, 

40 or Bacillus spec, e.g. B. thuringiensis , is transformed with a hybrid vector comprising such fused gene and that the 
polypeptide encoded by said gene is secreted from the host cells. When required, the polypeptide is isolated from the 
supernatant according to conventional methods. In a preferred embodiment of the invention a bacterial host is used 
which does not secrete a protease in its culture medium which degrades the expressed protein. 

Such a preferred embodiment of the invention is, for example, the production of proteins, e.g. desulfatohirudin, 

45 which are secreted from L. lactis LM0230 cells transformed with a hybrid vector of the invention comprising a gene 
encoding such protein, e.g. pSC12HIR, pSC12HIR-1 or in particular pSC12HlRTerm. Concentrations of active desul- 
fatohirudin can be obtained from supernatant of stationary phase cultures of L lactis LM0230 cells transformed with 
pSC12HIR or pSC12HIR-1. In the supernatant of L. lactis LM0230 cells transformed with pSC12HIRTerm the yield in 
desulfatohirudin is increased. Surprisingly, the levels of heterologous gene products, e.g. desulfatohirudin, do not de- 

50 crease after prolonged incubation, e.g. over night, of the stationary phase cultures indicating absence of proteolytic 
degradation of the heterologous gene products in the supernatant. 

In another embodiment of the invention the cells may be collected from the nutrient medium either in the log phase 
or in the stationary phase of the culture, e.g. by centrifugation or filtration, and resuspended in a smaller volume, e.g. 
in about 1 % up to 20 % of the original culture volume, of fresh nutrient medium or suitable buffer solution. Incubation 

55 of the resuspended cells results in an increased yield of heterologous gene product which is secreted into the super- 
natant. For example, in a culture of L. lactis LM0230 transformed with pSC12HIR an increased yield in desulfatohirudin 
can be obtained in the supernatant if the cells of a stationary phase culture are collected by centrifugation, resuspended 
in about 1/10 vol. of fresh nutrient medium and incubated for about 30 min at about 30°C. 
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Another embodiment of the invention is the production of MSP which is secreted from a gram positive host cell 
transformed with a hybrid vector of the invention, e.g. pUCRS, according to the methods described above. 

A further embodiment of the invention is the production of desulfatohirudin in B. thuringiensis , preferentially in 
strain HD1 cryB (DSM 4574), transformed with pVAHIR or pVAHIR-1. 

5 

Short description of the figures 

Figure 1: Physical map of plasmid pUC838. The map positions are approximate and given in kbp. The pUC 18 
mojety extends from map position 0 up to 2.7 and the pVA838 mojety carrying the erythromycin resistance gene extends 

10 from map position 2.7 up to 4.4. The restriction sites shown in brackets were destroyed during the preparation of the 
recombinant molecule. The meanings of the abbreviations are: amp r , ampicillin resistance gene; ery r , erythromycin 
resistance gene; ori puc , origin of replication derived from pUC 18. 

Figure 2: Physical map of plasmid pSCl2. The map positions are approximate and are given in kbp. pUC18 DNA 
extends from position 0 up to 2.7. The 1.7 kb erythromycin resistance carrying fragment of pVA838 extends from 

15 position 2.7 up to 4.4. The L lactis 2.5 Md plasmid derived insert extends from position 4.4 up to 8.0/0. The restriction 
sites shown in brackets were destroyed during the preparation of the recombinant molecules. The meanings of the 
abbreviations are: PL1 , pUC polylinker region from the EcoRI to the (Smal) site, PL2, pUC polylinker region from the 
(Smal) to the Sphl site; amp r , ampicillin resistance gene; ery r , erythromycin resistance gene; ori puc , origin of replication 
derived from pUC; ori 11s origin of replication derived from lactococcal 2.5 Md plasmid as conducted from deletion 

20 analysis. 

Figure 3: Physical map of plasmid pSC18. The map positions are approximate and are given in kbp. pUC18 DNA 
extends from position 0 up to 2.7. The 1.7 kb erythromycin resistance carrying fragment of pVA838 extends from 
position 2.7 up to 4.4. The L. lactis 5.2 Md plasmid derived insert extends from position 4.4 up to 8.5/0. The restriction 
sites shown in brackets were destroyed during the preparation of the recombinant molecules. The meanings of the 
25 abbreviations are: PL1, pUC polylinker region from the EcoRI to the (Smal) site; PL2, pUC polylinker region from the 
(BamHI) to the Hindlll site; amp r , ampicillin resistance gene; ery r , erythromycin resistance gene; ori puc , origin of rep- 
lication derived from pUC; ori 12 , origin of replication derived from lactococcal 5.2 Md plasmid as conducted from deletion 
analysis. 

30 Abbreviations 

dNTP deoxy nucleotide trisphosphate 

HPLC high performance liquid chromatography 

IPTG isopropyl-(i-D-thiogalactopyranoside 

35 kbp kilo base pairs 

kD kilo Daltons 

LB Luria broth nutrient medium (Gibco/BRL) 

Md Mega Daltons 

sCD23 soluble CD23, i.e. 25 k IgE-binding factor 

40 SLPI Secretory Leucoproteinase Inhibitor 

The following examples serve to illustrate present invention, however, they should not be construed as a limitation 
thereof. 

45 Material and Methods 

Strains 

L. lactis LM0230 is a plasmidfree derivative of L lactis C2 (Efstathiou J.D. and L.L McKay, 1977). It is deposited 
50 with the Deutsche Sammlung von Mikroorganismen und Zellkulturen (see hereinafter). 

L. lactis C2 (NCDO 2031 ), and L. lactis LL71 2 which latter serve as the source of the 2.5 Md and 5.2 Md plasmids, 
are closely related or identical (F.L. Davis et ak, 1981). The former is obtainable, for example, from the National Col- 
lection of Diary Organisms, UK (NCDO 2031), the latter is deposited with the Deutsche Sammlung von Mikroorganis- 
men und Zellkulturen (see hereinafter). 
55 E. coli TG1 has the genotype K1 2, A(lac-pro), supE, thi, hsdD5/F1raD36, proA + B + , lacH, lacZAMI 5 and is obtain- 

able from Amersham; it is described in the manual "Oligonucleotide-directed in vitro mutagenesis system" of Amer- 
sham. 
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General Methods: 
Transformation 

E. coli is transformed according to the calcium chloride method as described in Maniatis et al. (1 982). 
Transformation of lactococcal strains is performed by electroporation using a Gene Pulser® apparatus from BioRad 
and following their protocol for L lactis LM0230. 

Plasmid Preparation 

Plasmid DNA from E. coli is prepared according to the method of Birnboim and Doly (1979). 

The same method is used for the isolation of plasm ids from lactococci with the following modifications: 

a) Overnight cultures are grown in M17-G medium (Terzaghi and Sadine, 1975) at 30*0, diluted 1:10 into fresh 
medium and grown for another two hours. 

b) Incubation with lysozyme is performed also in the presence of mutanolysin (50 mg/J) for 10-20 min at 37°C. 

A plasmid pool from L lactis LL712 is prepared by the same method and the plasmid fraction is further purified by 
CsCI-ethidium bromide equilibrium centrifugation (Clewell and Helinski, 1969). The three smaller plasmids (1.8 Md, 
2.5 Md, 5.2 Md; Gasson, 1983) are enriched in the purified plasmid fraction while the two larger plasmids (9 Md and 
33 Md; Gasson, 1983) are present in smaller quantities. 

Examples: 

1 . Construction of Shuttle Vectors for Lactic Acid Bacteria and E. coli 

1.1 Construction of pUC838 

pVA838 (ATCC 37160; Macrina et al., 1982) is a lactococcal plasmid which carries the gene for constitutively 
expressed erythromycin resistance on a 1.7kbp Aval/Hindlll restriction fragment. About 10 u.g of pVA838 DNA is di- 
gested with Aval and Hindlll. The ends of the DNA fragments are blunt ended with Kleenow enzyme in the presence 
of all four dNTPs. The fragments are separated on an agarose gel and the 1 .7kbp Aval/Hindlll fragment is recovered 
from the gel by electroelution. 

About 200 ng of said fragment and 100 ng of Smal-cut pUC18 (Norrander et al.) are ligated in the presence of 
T4-!igase as suggested by Rusche et al. and the ligation mixture is used to transform E. coli TG1. White colonies are 
picked from LB agar plates containing X-Gal, JPTG and 1 00 mg/l of ampicillin. Correct clones are identified by isolating 
and analyzing their plasmid DNA, and by their ability to grow on LB agar plates containing erythromycin (100 mg/l). 
The resulting plasmid is designated pUC838. The restriction sites and characteristics of pUC838 are given in Figure 1 . 

1.2 Construction of pSC12 

The plasmids in the pool isolated from L. lactis LL712 are separated on a preparative agarose gel. The second 
DNA band from bottom corresponds to the ccc form of the 2.5 Md plasmid. It is cut out and the plasmid is elect roe luted. 
Digestion of this plasmid with Sphl and agarose gel electrophoresis reveales a single 3.5 kbp fragment indicating the 
presence of a single or several closely spaced Sphl cleavage sites in the 2.5 Md plasmid. 

pUC838 is cut at its unique Sphl site and treated with calf intestinal phosphatase. About 50 ng of phosphatase 
treated vector and 150 ng of the 3.5 kbp Sphl fragment of the 2.5 Md plasmid is ligated with T4 ligase and the ligation 
mixture is used to transform competent E. coli TG1 ceils. The transformation mixture is grown at 37°C in 1 ml of LB 
for 90 min and then transferred to 200 ml of LB containing 100 mg/l of erythromycin and grown at 37°C over night. 
Plasmid DNA is prepared from the cells of this culture and about 2 u.g are used to transform L. lactis LM0230. Trans- 
formed L. lactis LM0230 cells are selected at 30°C on M-17G agar plates containing 5 mg/l of erythromycin. 

Plasmid DNA is isolated from several clones of the transformed L. lactis LM0230 cells and subjected to restriction 
enzyme analysis. All transformants investigated contained a 8.0 kbp plasmid, pSC12, from which the 3.5 kbp Sphl 
insert can be recovered. A physical map of pSC12 is given in Figure 2. 

pSC12 is shuttled several times between L. lactis and E. coli without any obvious changes in restriction pattern. 
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1.3 Construction of pSC1B 

Plasmid pool DNA from L. lactis LL712 is partially restricted with Sau3AI by incubating 500 ng aliquots of DNA 
with different amounts of restriction enzyme. Samples containing partially cut DNA are identified on an agarose gel. 
5 This DNA is ligated to pUC838 vector which is previously cut at its unique BamHI site and treated with calf intestinal 
phosphatase. 

Transformation of E. coli TG1 plasmid isolation and subsequent transformation of L. lactis LM0230 is done as 
described for pSC12. Again, restriction digests are performed on plasmid DNA isolated from several clones of L lactis 
LM0230 transformants. They carry a 8.8 kbp plasmid, pSC18. Shuttling the plasmid several times between L. lactis 
10 and E. coli didn't result in any changes in restriction pattern indicating stability of the construct in both cell types. A 
physical map of pSC18 is given in Fig. 3. 

2. Identification of Lactococcal origins of replication 

is 2.1 Deletion Analysis of pSC12 and pSC18 

The regions on plasmids pSC12 and pSC18 carrying the lactococcus derived origins of replication are defined by 
deletion analysis as follows: 

DNA between appropriate restriction sites is deleted by cutting the plasmids with the corresponding restriction 
20 enzyme and recircularization with T4 ligase. The approximate location of the restriction enzyme cleavage sites in 
pSC12 and pSC18 are given in Figures 2 and 3, respectively, and in Table 1 . 

pSC12AP and pSC18AP are constructed by deleting the DNA between the two Pvull sites in the pUC18 part of 
the plasmids pSC12 and pSC18, respectively, thus removing the col El derived origins of replication. 
- In pSC12AN and pSC18AN the smaller Ndel-fragment of pSC12 or pSC18, respectively, is removed. 
25 pSC12AR lacks an EcoRV fragment in the lactococcal DNA of pSC12. 

In pSC12ANP DNA between the Pvull site in pUC and the Ndel site in the lactococcal part of pSC12 is removed. 
In this case the vector DNA is blunted with Kleenow enzyme before religation. 

A further derivative of pSC18, pSC18AH, lacks DNA spanning the Hindi 1 1 sites in the plasmid. 
All the mutated plasmid DNAs are first isolated from E. coli TG1 transformants that are resistant to erythromycin. 
30 it is then tested whether they can be used to transform L lactis LM0230 to erythromycin resistance. Failure to do so 
is assumed to indicate that the lactococcal origin of replication was removed or destroyed by the deletion. When eryth- 
romycin resistant transformants are obtained, the plasmid is reisolated and its structural integrity reconfirmed by re- 
striction analysis. 

For both the 2.5 Md and the 5.2 Md plasmid a region required for replication in streptococcus is identified. From 
35 the fact that the AP-derivatives could replicate in E. coli it is obvious that the cloned lactococcal origins can also function 
in the gram-negative E. coli bacteria. 

Results of these experiments are given in Table 1. 



Table 1: 



Results of the deletion 


analysis of pSC12 and pSC18 






Plasmid 


Deleted region 1 


Deleted fragment 2 


Replication of the plasmid in 


E. coli 


L. lactis 


pSC12 


none 




+ 


+ 


PSC12AP 


Pvull (0.1), Pvull 


2.4 


+ 


+ 




(2.5) 








PSC12AN 


Ndel (0.2), Ndel 


2.0 


+ 


+ 




(6.2) 








PSC12AR 


EcoRV (5.0), 


2.5 


+ 






EcoRV (7.5) 








pSC12ANP 


Ndel (6.2), Pvull 


4.3 


ND 3 


+ 




(2.5) 









1 The deletions extend between two restriction sites. The approximate positions of the restriction sites in pSCl2 or pSCl8 are given in the brackets 
as the distances from the Hindtll site in the pUC838-derived mojety of the plasmids which is defined as position (0). They relate to the base positions 
given in Figures 2 and 3, respectively, 
approximate length of the deleted fragment in kbp. 
3 No1 determined. 
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Table 1: (continued) 



Results ot the deletion analysis of pSC12 and pSC18 


Plasm id 


Deleted region 1 


Deleted fragment 2 


Replication of the plasm id in 


E. coli 


L. lactis 


pSC18 


none 




+ 


+ 


pSC18AP 


Pvull (0.1), Pvull 


2.4 


+ 


+ 




(2-5) 








pSC18AN 


Ndel (0.2), Ndel 


1.2 


+ 


+ 




(7.5) 








pSC18AH 


Hindlll (0), Hindlll 


2.0 


+ 






(6.5) 









•The deletions extend between two restriction sites. The approximate positions of the restriction sites in pSC12 or pSC18 are given in the brackets 



75 as the distances from the Hindlll site in the pUC838-derived mojety of the plasm ids which is defined as position (0). They relate to the base positions 

given in Figures 2 and 3, respectively, 
approximate length of the deleted fragment in kbp. 

2.2 Source of the lactococcal origins of replication 

20 

The origin of replication in pSC1 2 is derived from the 2.5 Md plasm id of L. lactis LL71 2 as is clear from the isolation 
protocol for the DNA fragment carrying the origin. 

The restriction enzyme cleavage pattern indicates that the origin of replication in pSC1S stems from the 5.2 Md 
plasmid. 

25 

2.3 Replication of pSC12 and pSC18 in other Lactic Lactococci 

Strains of L. lactis , L. lactis diacetylactis and L. cremoris are successfully transformed with both plasmids. The 
plasmids remain stably associated with the cell population under selective pressure. 

30 

3. Cloning of the Gene encoding the Major Secretion Product (MSP) of L. Lactis LM 0230 

3.1. isolation of the Major Secreted Protein (MSP) of L. Lactis LM 0230 

3S 1 .5 I of an overnight culture of L lactis LM 0230, grown at 30 °C in M-17G, is centrifuged in a Sorvall GS-3 rotor 

at 7000 rpm for 20 min at 4 °C. The supernatant is collected and proteins precipitated by adding an equal volume of 
ice cold 10 % trichloroacetic acid and incubating for 30 min at room temperature. The precipitate is collected by cen- 
trifugation in a Sorvall GS-3 rotor at 8 000 rpm and 4 °C for 30 min. The protein pellets are drained and redissolved in 

3 ml of SDS-sample buffer (Laemmli, 1970). The sample is neutralized by adding a small amount of 4N NaOH and 
40 then dialyzed for 4 h against 4 I of 25 mM Tris-HCI pH 6.8, 0.02 % SDS. The sample recovered has a volume of about 

4 ml. 3x cone. SDS-sample buffer is added to the sample in order to enlarge the volume to 6 ml. 

Proteins are separated by running 2 ml aliquots on preparative 8 % SDS polyacrylamide gels (Laemmli, 1970) 
using a Protean cell from BioRad. Small strips are cut alongside from the gel, stained with Coomassie Brilliant Blue 
and destained in an aqueous solution containing 10 % of Methanol and 10 % of Acetic acid. They serve as markers 
45 to identify and cut out the major protein band with an apparent molecular weight of about 56 kD which is cut out of the gel. 

The MSP protein is recovered from the gel by electroelution at 1 50 V for 2 h using a Biotrap apparatus and a buffer 
containing 20 mM ammonium acetate and 0.01 % SDS. The eluate is dialyzed for 48 h against two changes of 20 mM 
ammonium acetate, 0.005 % SDS. A sample of the dialyzed protein is run on a 8 % SDS-PAGE and a single sharp 
protein band is observed with an apparent molecular weight of about 56 kD. 

so 

3.2 Analysis of the amino-terminal Sequence of MSP 

Amino-terminal sequencing of the isolated MSP protein is performed according to conventional methods using a 
gas phase sequenator (Applied Biosystems Inc., Model 470 A) with HPLC quantitation of phenylthiohydantoin-deriva- 
55 tives of cleaved amino acid residues. 

The 28 amino acid long sequence obtained is 
X-X-Asn-Ser-Asp-lle-Ala-Lys-Gln-Asp-Ala-Thr-N The 
first two amino acids and the amino acid in position 15 which are indicated with "X" are not determined. 
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3.3 Synthesis of a mixed Oligonucleotide Probe 

Based on the amino acid sequence Asp-lle-Ala-Lys-Gln-Asp-Ala-Thr-lle, which corresponds to the amino acids in 
position 5 to 13 in the N-terminus sequence of MSP, a mixed oligonucleotide is designed and constructed for the 
s screening of a DNA library. Inosine is inserted at the two positions in the 23-mer where the degeneracy of the genetic 
code would require all 4 dNTPs in order to decrease the complexity of the probe. The nucleotide sequence of the 
oligonucleotide mixture is GAN, ATN 2 GCI AAN 3 CAN 3 GAI^ GCI AC. ^ is T or C; N 2 is T, C or A; N 3 is A or G. A 
represents a nucleotide with the base Adenine, T with Thymine, C with Cytidine, G with Guanosine and I with Inosine. 

10 3.4. Construction of a Genomic Library of L lactis LM 0230 

Chromosomal DNA is isolated from L. lactis LM 0230 by a modification of the protocol used for plasmid isolation: 
after treatment with lysozyme and mutanolysin the cells are incubated for two hours at 56°C with proteinase K (100 
mg/l) in 10 mM Tris-HCI pH 8, 20 mM EDTA, 0.5 % SDS. The DNA is extracted once with phenol/chloroform (1 vol:1 
15 vol) and then purified by CsCI density gradient centrifugation. 

Chromosomal DNA is partially digested with Sau3AI and size fractionated on sucrose gradients as described in 
Maniatis. Fragments between about 10-20 kbp are collected. 

X EMBL 3 DNA is cleaved with BamHI and EcoRI and purified by phenol extraction and ethanol precipitation. It is 
then ligated with the 10-20 kbp fragments in the presence of hexamminecobalt(lll)chloride to favour the formation of 
20 concatamers. (Rusche J.R. et al., 1985). The ligation mixture is packaged in vitro using the Gigapack Plus® system 
from Stratagene. 

Recombinants are selected by plating the library on E. coli Q 359 (Kahn J. et al., 1980). A total of about 600 000 
recombinant phages is obtained. 

25 3.5 Screening the Library for DNA sequences encoding MSP 

About 15 000 recombinant X EMBL 3 plaques are plated per petri dish (15 cm diameter) and transferred to Plaque- 
Screen® membranes (NEN). The filters are screened by hybridization with the mixed oligonucleotide probe described 
in Example 3.3 which was labeled with [y^PJATP using T4 Kinase according to standard procedures described in 
30 Maniatis et al. (1982). 

Positive plaques are identified and subjected to a second round of screening at low plaque density. 

DNA is prepared from positive phages, digested with restriction enzymes and subjected to Southern analysis as 
described in Maniatis. Southern blots are probed with the mixed oligonucleotide. It hybridizes to a 3.5 kbp EcoRI/Sall 
fragment in the L. lactis insert of a positive phage. Further probing showed a 2.1 kbp Hindi II fragment within the 3.5 
35 kbp EcoRI/Sall fragment to hybridize with the oligonucleotide mixture. 

The approximate distances of different restriction sites from the EcoRI cut end of the 3.5 kbp EcoRI/Sall fragment 
are determined by agarose gel electrophoresis after digestion of the fragment with the respective enzymes and suitable 
enzyme mixtures. The EcoRI cut end is located about 0.5 kbp away from a Hindi II site, about 2.6 kbp from a second 
Hindi II site, and about 3.15 kbp from a third Hindi 1 1 site. 

40 

3.6 Construction of pUCRS 

DNA from a positive phage is digested with EcoRI and Sail. The DNA fragments are separated on an 0.6 % agarose 
gel and the 3.5 kbp EcoRI/Sall fragment is cut out and isolated by elect roe I ut ion. A three fold molar excess of this 
45 fragment is ligated to pUC19 which is cut with EcoRI and Sail and treated with alkaline phosphatase. 

The ligation mixture is used to transform E. coli TG1 and transformants are selected on plates containing 100 mg/ 
I of ampicillin. Plasmid DNA is prepared from transformants and restriction analyses is performed to identify the correct 
construct which is designated pUCRS. 

E. coli TG1 transformed with pUCRS is deposited with the Deutsche Sammlung von Mikroorganismen und Zel- 
50 Ikulturen (DSM). 

3.7. Sequencing the gene for the Major Secretion Product (MSP) 

Sequencing is done by the chain termination method (Sanger F. et al., 1977) using Sequenase® from USBC. 
55 The 3.5 kbp EcoRI/Sall fragment prepared as described in Example 3.6 is subcloned into M1 3 mp1 8. The resulting 

plasmid is named M13 mpl8RS. The 2.1 kbp Hindlli fragment of the 3.5 kbp EcoRI/Sall fragment is cloned into M13 
mp1 9, the resulting plasmid is M1 3 mpl 9H. A set of unidirectional deletions is generated from the Sail end of the insert 
in M13 mp18RS by digesting the DNA with exonucleaselll and SI (Henikoff, 1984; Yanisch-Perron et al., 1985). The 
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deletions extend about 2 kbp into the insert. Sequencing these deletions from the universal mp18 primer yielded the 
sequence information for one strand. Based on this sequence, a set of synthetic oligonucleotides is synthesized and 
serves as primers to sequence the second strand. Additional sequence information is gained by sequencing exolll-S1 
deletions generated from M13 mp19H with the same method. The DNA sequence of a 1920 bp fragment of the 3.5 

5 kbp EcoRI/Sall fragment is depicted in the sequence listing under SEQ ID No. 1 . It comprises part of the MSP promoter 
region, the DNA sequence encoding the MSP signal peptide and the structural gene for mature MSP. 

Analysis of the about 2 kbp long DNA sequence revealed the presence of a single open reading frame (ORF) 
coding for a protein having 461 amino acid residues. Comparison of the deduced amino acid sequence with the data 
obtained from sequencing the N -term in us of the isolated MSP identified a mature form of the protein with 434 amino 

10 acids which is preceded in frame by 27 amino acids. The peptide formed by the latter amino acids has the structure 
of a typical signal peptide. Charged residues are found at the N-terminus while the rest of the sequence consists mainly 
of hydrophobic amino acids. The amino acid composition around the cleavage site follows the rules deduced from 
other signal peptides (Von Heijne, 1983). 

15 4. Production of recombinant Desuffatohirudin in Lactococcus lactis 

4.1 Construction of Desulfatohirudin Secretion Plasmids 

Plasmid M1 3 mp19H contains a 2.1 kbp Hindi 1 1 fragment encoding the N-terminus half of MSP including the signal 
20 peptide and about 1 .5 kbp of upstream DNA. This plasmid is cleaved at its unique Seal site 1 29 bp downstream of the 
COOH-terminus of the signal peptide. A blunt ended 211 bp DNA fragment encoding the genetic information for des- 
ulfatohirudin is isolated from plasmid pML310 (published in the European Patent Application EP-A-168 342) and in- 
serted at the cleaved Seal site of M1 3 mpl9H. An in frame fusion between the DNA sequence encoding the MSP signal 
peptide and the desulfatohirudin structural gene is created by removing the excess DNA separating the codon for the 
25 last amino acid of the signal peptide (Ala) and the codon for the first amino acid of desulfatohirudin (\fel). This is 
achieved by oliqonucieotide-directed in vitro mutagenesis using the system from Amersham. The 29 bp oligonucleotide 
comprises the last 14 bp of the signal peptide and the first 15 bp of the hirudin gene. The sequence of the resulting 
fusion is depicted in the sequence listing under SEQ ID No. 2. 

The Hindi! I fragment carrying the fusion is cut out and inserted into the unique Hindi 1 1 of pSC1 2. Both orientations 
30 of the insert are recovered. The resulting ptasmids are named pSC12HIR and pSC12HIR-1 , respectively. 

Another plasmid (pSC12HIRTerm) is constructed by inserting the trpA transcription terminator of E. coli (Pharma- 
cia) at the unique Hpal site of pSC12HIR about 125 bp downstream of the desulfatohirudin gene. 

4.2 Secretion of Desulfatohirudin by L. tactis 

35 

The desulfatohirudin fusion plasmids are transformed into L. lactis LM 0230. Transformants are grown in M-17G 
medium supplemented with 2 % of glucose in the presence of erythromycin (5 mg/l) at 30°C. When the cultures reach 
stationary phase, 1.5 ml samples are centrifuged in an Eppendorf® tube. The culture supernatants are removed and 
frozen at -70°C. The cultures are then further grown overnight before a second sample is taken, centrifuged, the su- 
40 pernatant removed and also frozen. The production of secreted desulfatohirudin is determined with a bioassay. 

The bioassay measures the thrombin inhibition activity of desulfatohirudin in the supernatants collected. A detailed 
protocol of the bioassay, which is a thrombin inhibition assay is presented hereinafter: 

The buffer used for making all dilutions of samples and reagents is 0.2M Tris-HCl, pH 7.5 containing 1.0M NaCI 
and 0.01 % bovine serum albumin. Thrombin is from human plasma (Protogen AG, Laufetfingen, product No. 
45 80-13-1102), the chromogenic thrombin substrate is Chromozyme TH (Boehringer, Mannheim, product No 206849). 
The p-nitroaniline released from Chromozym TH is measured with a Dynatech MR 600 microplate reader. All assays 
are performed in microtiter plates (Nunc, MicroWell plates). Each well receives: 50 uJ buffer, 50 uJ supernatant with 
unknown concentration of thrombin inhibitory activity, i.e. desulfatohirudin, and 25 u.l thrombin solution. The reaction 
is started by adding 150 uJ substrate solution (330 p.g/ml) and the plates are incubated for two hours at 37°C. The 
50 concentration of thrombin is adjusted to give an A^nm = 0.8 ± 0.2 in an uninhibited control well. Both substrate and 
thrombin solutions are kept frozen at -20° C and are thawed immediately before being used. A standard curve with 
known concentrations of a [Tyr^desutfatohirudin (40, 20, 10, 5, 2.5 and 1 .25 ng/ml) is used to convert the OD-meas- 
urements into the concentrations of active desulfatohirudin. 

In the supernatant of stationary phase cultures of L. lactis LM0230 cells transformed with either pSC12HIR or 
55 pSC1 2HIR-1 the same level of desulfatohirudin activity is determined. In the supernatant of a stationary phase culture 
of L. lactis LM0230 cells transformed with pSC12HIRTerm, the level of desulfatohirudin activity is increased by about 
50 %. Supernatants of cells transformed as a control with pSC1 2 contain no desulfatohirudin activity. 

The levels of desulfatohirudin in the supernatant did not decrease after prolonged incubation for about 16 h of the 
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stationary phase cultures. These results indicate the absence of proteolytic degradation in the supernatant of L lactis 
LM0230. 

In one experiment, cells are first collected by centrifugation and then resuspended in 1/1 0 volume of fresh medium 
and incubating at 30°C for 30 min expression period. Bioassay measures an about six fold higher level of desulfa- 
5 tohirudin activity than in the experiments described hereinabove. In a more sophisticated form, concentration of the 
cells for an expression period is a way to increase the concentration of secreted product in the supernatant. 

5. Production of recombinant Desulfatohirudin in Bacillus thuringiensis 

io 5.1 Construction of plasmid pVAHIR 

Plasmid pVA838 (Macrina et al, 1982) is digested with Hindi 1 1 and the 5.0 kbp fragment carrying the gene for 
erythromycin resistance and the gram positive origin of replication is isolated from a 0.5 % agarose gel by electroelution. 
The Hindlll fragment containing the desulfatohirudin gene is isolated in a similar manner from pSC12HIR. 
15 The two fragments are ligated in the presence of T4-ligase and the ligation mixture is directly used to transform 

L. lactis LM 0230 by electroporation. Erythromycin resistant transformants are selected and plasmid DNA is prepared 
from them. 

Restriction analyses of these plasm ids is performed to identify correct constructions. Both orientations of the Hindlll 
desulfohirudin expression cassette are obtained and can be used for the production of Desulfatohirudin in Bacillus 
20 thuringiensis. The plasmids are named pVAHIR and pVAHIR-1. 

5.2 Secretion of Desulfatohirudin by B. thuringiensis 

B. thuringiensis strain HID1 cryB (DSM 4574) is transformed with pVAHIR using electroporation (W. Schurter et 
25 al, 1989). 

Transformants are selected on LB plates containing 20 |ig/ml of erythromycin at 27°C or 30°C. Plasmid DNA is 
prepared from individual transformants by the same method as used for L lactis . Restriction analyses reconfirms that 
the structure of the isolated plasmids is identical to pVAHIR. 

Transformants harbouring pVAHIR or pVA838 - as a control - are grown in LB containing 20 mg/l of erythromycin 
30 at 27°C over night. The cultures are diluted 1:200 into fresh medium and grown further at 30°C. The cultures are 
centrifuged and the supernatants are assayed for desulfohirudin activity 7 h after dilution. 

Desulfatohirudin is detected in the supernatant of cells transformed with pVAHIR, while none is measured in the 
control with pVA838. 

35 Deposited Microorganisms 

The following microorganisms are deposited according to the Budapest Treaty with the Deutsche Sammlung von 
Mikroorganismen und Zellkulturen (DSM), Mascheroder Weg 1b, D-3300 Braunschweig: 



Microorganism 


Deposition No. 


Date of deposition 


E. coliK12TG1/pUCRS 
Lactococcus lactis LL712 
Lactococcus lactis LM0230 


DSM 5803 
DSM 5804 
DSM 5805 


February 16, 1990 
February 16, 1990 
February 16, 1990 



45 
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Sequence Listing 
SEQ ID No. 1 

25 SEQUENCE TYPE: Nucleotide with corresponding protein 

SEQUENCE LENGTH: 1 920 base pairs 

STRANDEDNESS: double 

TOPOLOGY: linear 

MOLECULE TYPE: genomic DNA 
30 ORIGINAL ORGANISM SOURCE: Lactococcus lactis LM0230 (DSM 5805) 

IMMEDIATE EXPERIMENTAL SOURCE: Plasmid pUCRS (DSM 5803) 

POSITION IN THE GENOME: chromosomal 

FEATURES: from 

35 1 to 410 part of the promoter region 

411 to 491 MSP signal peptide 
492 to 1793 mature MSP protein 

PROPERTIES: Major secretion product (MSP) gene of L. lactis. 

40 



TTTAGGTATT 


TACGGAATTG 


CGACCTTATT 


GTTCCCACTT 


40 


ATTGCTCTTT 


TTGTATATAA 


TATACAAATA 


ACTATATTTA 


80 


CTAATCGCTG 


GACAAGGCTT 


TTTACAACAA 


TTATTATTGT 


120 


GACCGCTTTT 


GAAGTTTTTA 


GTGCAATCAT 


TATGACAGCT 


160 


TTTGGATTTG 


CCCAACTTCA 


GTTTATCAAA 


TTTGTTGTTT 


200 


ACCAGTTAGC 


GCCTACACTT 


TTGCTCAATA 


TTATCTTAGC 


240 


TGTAGCCTTA 


CAATTCCCTT 


TAGAAATCTT 


TT AC AG ATT A 


280 


AAGAAAAGTC 


ATGTAAGATA 


CAATTAGAAA 


GTGTTTTGTA 


320 
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10 



15 



20 



25 



30 



40 



45 



50 



55 



ATCATAAAGA AATATTAAGG TGGGGTAGGA ATAGTATAAT 360 

ATGTTTATTC AACCGAACTT AATGGGAGGA AAAATTAAAA 4 00 

AAGAACAGTT ATG AAA AAA AAG ATT ATC TCA GCT 434 
Met Lys Lys Lys He He Ser Ala 



ATT TTA ATG TCT ACA GTG ATA CTT TCT GCT GCA .4 67 

He Leu Met Ser Thr Val lie Leu Ser Ala Ala 

10 15 

GCC CCG TTG TCA GGT GTT TAC GCT GAC ACA AAC 500 

Ala Pro Leu Ser Gly Val Tyr Ala Asp Thr Asn 

20 25 30 

TCA GAT ATT GCT AAA CAA GAT GCG ACA ATT TCA 5 33 

Ser Asp He Ala Lys Gin Asp Ala Thr He Ser 

35 40 

AGC GCG CAA TCT GCT AAA GCA CAA GCA CAA GCA 5 66 

Ser Ala Gin Ser Ala Lys Ala Gin Ala Gin Ala 

45 50 

CAA GTT GAT AGC TTG CAA TCA AAA GTT GAC AGC 599 

Gin Val Asp Ser Leu Gin Ser Lys Val Asp Ser 

55 60 

TTA CAA CAA AAG CAA ACA AGT ACT AAA GCA CAA 632 

Leu Gin Gin Lys Gin Thr Ser Thr Lys Ala Gin 

65 70 

ATC GCT AAA ATC GAA AGC GAA CGT AAA GCA CTT 665 

He Ala Lys He Glu Ser Glu Arg Lys Ala Leu 

75 80 85 

AAT GCT CAA ATT GCT ACT TTG AAC GAA AGT ATC 698 

Asn Ala Gin He Ala Thr Leu Asn Glu Ser He 

90 100 

AAA GAA CGT ACA AAG ACA TTG GAA GCT CAA GCA 731 

Lys Glu Arg Thr Lys Thr Leu Glu Ala Gin Ala 

105 110 
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CGT AGT GCT CAA GTT 
Arg Ser Ala Gin Val 

115 

TAT ATG GAT GCT GTT 
Tyr Met Asp Ala Val 

125 

ACA GAT GTT ATT CAA 
Thr Asp Val He Gin 

135 

ACT GTT TCT AGT GCC 
Thr Val Ser Ser Ala 

150 

CAA CAA GAA AAA GAG 
Gin Gin Glu Lys Glu 

160 

AAG TCA GAA ACT GTT 
Lys Ser Glu Thr Val 

170 

TTC GTT TCT CTT TCA 
Phe Val Ser Leu Ser 

180 

GCT CAA GAA TTG ACT 
Ala Gin Glu Leu Thr 

190 

AAA GTT GCG ACT TTG 
Lys Val Ala Thr Leu 

205 

GCA ACT GCG CAA GAT 
Ala Thr Ala Gin Asp 

215 



AAC AGC TCA GCA ACA AAT 
Asn Ser Ser Ala Thr Asn 

120 

GTT AAT TCA AAA TCT TTG 
Val Asn Ser Lys Ser Leu 

130 

AAA GTA ACA GCT ATT GCT 
Lys Val Thr Ala He Ala 

140 145 

AAC AAA CAA ATG TTG GAA 
Asn Lys Gin Met Leu Glu 

155 

CAA AAA GAG CTT AGC CAA 
Gin Lys Glu Leu Ser Gin 

165 

AAA AAG AAC TAC AAC CAG 
Lys Lys Asn Tyr Asn Gin 

175 

CAA AGT TTG GAT TCT CAA 
Gin Ser Leu Asp Ser Gin 

185 

TCA CAA CAA GCT GAA CTC 
Ser Gin Gin Ala Glu Leu 

195 200 

AAC TAT CAA GCA ACA ATT 
Asn Tyr Gin Ala Thr He 

210 

AAA AAA CAA GCT TTA TTA 
Lys Lys Gin Ala Leu Leu 

220 
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GAT GAA AAA GCA GCT GCA GAA AAA GCA GCT CAA 1094 
Asp Glu Lys Ala Ala Ala Glu Lys Ala Ala Gin 

s 225 230 

GAA GCA GCT AAA AAA CAA GCG GCT TAT GAA GCT 1127 
Glu Ala Ala Lys Lys Gin Ala Ala Tyr Glu Ala 

10 235 240 

CAA CAA AAA GAA GCA GCA CAA GCA CAA GCA GCT 1160 
Gin Gin Lys Glu Ala Ala Gin Ala Gin Ala Ala 

is 245 250 255 

TCA ACA GCA GCA ACT GCT AAA GCT GTA GAA GCA 1193 
Ser Thr Ala Ala Thr Ala Lys Ala Val Glu Ala 

20 260 265 

GCA ACT TCA TCA GCT TCT GCT TCA TCT AGT CAA 122 6 

Ala Thr Ser Ser Ala Ser Ala Ser Ser Ser Gin 



25 



30 



35 



40 



45 



SO 



55 



270 275 

GCT CCA CAA GTA AGT ACA AGC ACT GAT AAT ACA 125 9 

Ala Pro Gin Val Ser Thr Ser Thr Asp Asn Thr 

280 285 

ACA TCA AAT GCT AGT GCC TCA AAC AGT TCT AAT 1292 
Thr Ser Asn Ala Ser Ala Ser Asn Ser Ser Asn 

290 295 

AGT TCA TCA AAC TCA AGT TCA AGT TCT AGC AGT 1325 
Ser Ser Ser Asn Ser Ser Ser Ser Ser Ser Ser 

300 305 310 

TCA TCA AGC TCA AGC TCA AGC TCA AGT AAT TCT 1358 
Ser Ser Ser Ser Ser Ser Ser Ser Ser Asn Ser 

315 320 

AAT GCT GGT GGG AAT ACA AAT TCA GGC ACT AGT 1391 
Asn Ala Gly Gly Asn Thr Asn Ser Gly Thr Ser 

325 330 

ACT GGA AAT ACT GGA GGA ACA ACT ACT GGT GGT 1424 
Thr Gly Asn Thr Gly Gly Thr Thr Thr Gly Gly 

335 340 
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AGC GGT ATA AAT AGT 
Ser Gly lie Asn Ser 

345 

TAT GCT GTT GGT GGA 
Tyr Ala Val Gly Gly 

355 

CAA TAC TTT GCT GCA 
Gin Tyr Phe Ala Ala 

370 

AAT ATC ATG CCT GGT 
Asn lie Met Pro Gly 

380 

TCT AAT GGA CCT GCC 
Ser Asn Gly Pro Ala 

390 

GTA GGA GCT GCT CCT 
Val Gly Ala Ala Pro 

400 

TTC TCA GCT GAT TTT 
Phe Ser Ala Asp Phe 

410 

CCT TAC GGT CAC GTA 
Pro Tyr Gly His Val 

425 

AAT TCA GAT GGT ACA 
Asn Ser Asp Gly Thr 

435 

GGA TAT GGT ACA ACT 
Gly Tyr Gly Thr Thr 

445 

ACT GTA AGT GCG TCT 
Thr Val Ser Ala Ser 

455 



TCA CCA ATT GGA AAT CCT 
Ser Pro lie Gly Asn Pro 

350 

TGT ACT GAC TAT GTA TGG 
Cys Thr Asp Tyr Val Trp 

360 365 

CAA GGA ATT TAT ATC AGA 
Gin Gly lie Tyr lie Arg 

375 

AAT GGT GGA CAA TGG GCT 
Asn Gly Gly Gin Trp Ala 

385 

CAA GGC GTG CTC CAT GTT 
Gin Gly Val Leu His Val 

395 

GGT GTT ATC GCA TCA AGC 
Gly Val He Ala Ser Ser 

405 

GTT GGA TAT GCA AAC TCA 
Val Gly Tyr Ala Asn Ser 

415 420 

GCT ATT GTA AAA TCA GTT 
Ala He Val Lys Ser Val 

430 

ATT ACT ATC AAA GAA GGC 
He Thr He Lys Glu Gly 

440 

TGG TGG GGA CAT GAA CGT 
Trp Trp Gly His Glu Arg 

450 

GGT GTT ACT TTC TTG ATG 
Gly Val Thr Phe Leu Met 

460 
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CCA AAC TAG AAAAAAGTCT TAATAAATAA AAAATAGTGG 
Pro Asn 

465 

TTTGATAGTG GGGAATAATT TTCCTTCTGT CAAATCATTT 
TTTATTATTG TGGTATAATA ATAAGGAAAA ATGATAAGGG 
GATAGATACA AATG 

SEQIDNo.: 2 

SEQUENCE TYPE: Nucleotide with corresponding polypeptide 
SEQUENCE LENGTH: 279 bp 
STRANDEDNESS: double 
TOPOLOGY: linear 
MOLECULE TYPE: recombinant 
ORIGINAL SOURCE 

ORGANISMS: L. Lactis LM0230 (DSM 5805), Hirudo medicinalis 
IMMEDIATE EXPERIMENTAL SOURCE: Plasmid pUCRS (DSM 5803), plasmid P ML310 (see European pat- 
ent application EP-A-168 342) 
FEATURES: from 

1 to 81 bp MSP signal peptide 

82 to 279 bp coding region for desulfatohirudin 

PROPERTIES: Fusion of the L. Lactis LM0230 DNA encoding the MSP signal peptide and of the hirudin 
structural gene for the production of secreted hirudin in bacteria. 



ATG AAA AAA AAG ATT ATC TCA GCT ATT TTA ATG TCT 36 
Met Lys Lys Lys lie He Ser Ala He Leu Met Ser 
-25 -20 



ACA GTG ATA CTT TCT GCT GCA GCC CCG TTG TCA GGT 72 
Thr Val He Leu Ser Ala Ala Ala Pro Leu Ser Gly 
-15 -10 -5 

GTT TAC GCT GTT GTT TAC ACC GAC TGC ACC GAA TCT 108 
Val Tyr Ala Val Val Tyr Thr Asp Cys Thr Glu Ser 
1 5 



1826 

1866 
1906 
1920 
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GGT CAG AAC CTG TGC CTG TGC GAA GGT TCT AAC GTT 
Gly Gin Asn Leu Cys Leu Cys Glu Gly Ser Asn Val 
10 15 20 



144 



TGC GGT CAG GGT AAC AAA TGC ATC CTG GGT TCT GAC 
Cys Gly Gin Gly Asn Lys Cys He Leu Gly Ser Asp 
25 30 



180 



GGT GAA AAA AAC CAG TGC GTT ACC GGC GAA GGT ACC 
Gly Glu Lys Asn Gin Cys Val Thr Gly Glu Gly Thr 
35 40 45 



216 



CCG AAA CCG CAG TCT CAC AAC GAC GGT GAC TTC GAA 
Pro Lys Pro Gin Ser His Asn Asp Gly Asp Phe Glu 

50 55 



252 



GAA ATC CCG GAA GAA TAC CTG CAG TAG 
Glu He Pro Glu Gin Tyr Leu Gin 
60 65 



279 



Claims 



1. hybrid vector comprising 

a) the approximately 3.5 kbp EcoRI/Sall L. lactis insert of the plasmid pUCRS deposited under the accession 
number DSM 5803, or a functional fragment thereof retaining promoter, signal or structural functions, or 

b) a DNA sequence which hybridizes with said insert or with a functional fragment thereof, or such a hybridizing 
DNA sequence operatively linked to its natural promoter region or 

c) a degenerate sequence of the DNA sequence covered in a) which encodes a signal peptide, 

2. A hybrid vector according to claim 1 comprising the origin of replication of a plasmid belonging to the same incom- 
patibility group as the 2.5 Md or the 5.2 Md plasmid of L lactis strain LL71 2 deposited under the accession number 
DSM 5804, which origin can be obtained from the approximately 1.8 kbp long Ndel/Sphl fragment of the 2.5 Md 
plasmid of L. lactis LL712 or the 1.0 kbp Ndel/Hindlll fragment of the 5.2 Md plasmid of L. lactis LL712. 

3. A hybrid vector according to claim 1 or 2 comprising the approximately 3.5 kbp EcoRI/Sall L. lactis insert of the 
plasmid pUCRS, deposited under the accession number DSM 5803, or a functional fragment thereof. 

4. A hybrid vector according to claim 1 or 2 in which said functional fragment retains the promoter functions of the 
major secretory protein gene. 

5. A hybrid vector according to claim 4 in which the functional fragment retaining promoter function spans the region 
of 1500 bp upstream of the DNA sequence encoding the signal peptide. 
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6. A hybrid vector according to claim 5 in which the functional fragment retaining promoter function spans a region 
between 100 bp up to 1000 bp upstream of the DNA sequence encoding the signal peptide. 

7. A hybrid vector according to claim 1 or 2 in which said functional fragment retains the function of the major secretory 
5 protein signal sequence. 

8. A hybrid vector according to claim 7 in which the functional fragment retaining the signal function encodes the 27 
amino acid long signal sequence disclosed in SEQ ID No. 2. 

10 9. A hybrid vector according to claim 7 in which the functional fragment retaining the signal function extends from a 
base in about position 411 of the DNA sequence with SEQ ID No. 1 up to about position 491 . 

1 0. A hybrid vector according to claim 3 in which said functional fragment retains the promoter and the signal functions 
of the major secretory protein gene. 

15 

11. A hybrid vector according to claim 10 in which the functional fragment is selected from the group of fragments 
extending from any base between the EcoRI cut end of the approximately 3.5 kbp EcoRI/Sall L. lactis insert of the 
plasmid pUCRS deposited under the accession number DSM 5803 and the first Hindlll restriction site located in 
said insert upstream of the base corresponding to position 1 of the DNA sequence with SEQ ID No. 1 . up to about 

20 the base corresponding to position 491 of the DNA sequence with SEQ ID No. 1 . 

12. A hybrid vector according to claim 3 in which the functional fragment extends from a base in about position 492 
up to about position 1793 of the DNA sequence with SEQ ID No. 1 . 

25 13. A hybrid vector according to claim 1 or 2 comprising a DNA sequence which hybridizes with the approximately 3.5 
kbp EcoRI/Sall L. lactis insert of the plasmid pUCRS, deposited under the accession number DSM 5803, or with 
a functional fragment thereof, or such a hybridizing DNA sequence operatively linked to its natural promoter region. 

14. A hybrid vector according to claim 13 comprising a DNA gene sequence hybridizing to the gene encoding the 
30 major secretory protein of L. lactis or to a functional fragment thereof. 

15. A hybrid vector according to claim 1 comprising a DNA sequence which encodes a signal peptide according to 
claim 1c). 

35 16. A hybrid vector according to claim 1 or 2 comprising a homologous or heterologous structural gene which is op- 
eratively linked in a proper reading frame to the DNA sequence encoding the signal peptide of the major secretory 
protein and a promoter. 

17. A hybrid vector according to claim 16 encoding desulfatohirudin or the major secretory protein of L. Lactis strain 
40 LM 0230 as described in SEQ ID No. 1 . 

18. A DNA molecule which 

a) is the approximately 3.5 kbp EcoRI/Sall L. lactis insert of the plasmid pUCRS deposited under the accession 
45 number DSM 5803, or a functional fragment thereof retaining promoter, signal or structural functions, or 

b) hybridizes with said insert or with a functional fragment thereof, or is such a hybridizing DNA sequence 
operatively linked to its natural promoter region, or 

so c) is a degenerate sequence of the DNA sequence covered in a) which encodes a signal peptide. 

19. A hybrid DNA molecule according to claim 18 comprising the origin of replication of a plasmid belonging to the 
same incompatibility group as the 2.5 Md or the 5.2 Md plasmid of L. lactis strain LL712 deposited under the 
accession number DSM 5804, which origin can be obtained from the approximately 1.8 kbp long Ndel/Sphl frag- 

55 ment of the 2.5 Md plasmid of L. lactis LL71 2 or the 1 .0 kbp Ndel/Hindlll fragment of the 5.2 Md plasmid of L. lactis 

LL712. 

20. A host transformed with a hybrid vector according to claim 1 or 2. 
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21. A transformed host according to claim 20 : wherein the host is selected from the group consisting of E. coli strains 
TG1, C600 and HB101; L. cremoris strains; plasmid-free L lactis strains and Bacillus strains. 

22. A transformed host according to claim 20 transformed with a hybrid vector according to claim 17. 

s 

23. A method for the preparation of a transformed host according to claim 20 comprising the steps of (a) treating a 
host under transforming conditions with a hybrid vector of claim 1 or 2, optionally together with a selection marker 
gene, and (b) selecting the transformants. 

to 24. A method for the preparation of a polypeptide comprising the steps of (a) fusing a homologous or heterologous 
structural gene in proper reading frame with the DNA sequence coding for the signal peptide according to claim 
9, (b) transforming a suitable host with a hybrid vector comprising a fused gene prepared in (a), (c) secreting the 
polypeptide encoded by said fused gene from the host cells and, when required, (d) isolating the polypeptide from 
the supernatant according to conventional methods. 

15 

25. A method according to claim 24 in which a host selected from the group of hosts consisting of Lactococcus spec, 
and Bacillus spec, is used. 

26. A method according to claim 24 in which a bacterial host is used which does not secrete in its culture medium a 
20 protease which degrades the expressed protein. 

27. A method according to claim 24 in which desulfatohirudin is produced. 

28. A method according to claim 27 wherein cells of L. lactis strain LM0230 as deposited under the accession number 
25 DSM 5805 are used. 

29. A method according to claim 24 characterized in that (a) the transformed cells are collected from the nutrient 
medium either in the log phase or in the stationary phase of the culture, (b) they are resuspended in a smaller 
volume of fresh nutrient medium or suitable buffer solution and (c) they are further incubated before isolating the 

30 desired product from the supernatant. 

30. A method according to claim 29 characterized in that the transformed cells are resuspended in about 1 % up to 
20 % of the original culture volume. 

35 31 . A method according to claim 30 wherein the transformed cells are cells from L. lactis strain LM0230 as deposited 
under the accession number DSM 5805 transformed with a hybrid vector according to claim 17 encoding desul- 
fatohirudin and are resuspended in about 1/10 vol. of fresh nutrient medium and are incubated for about 30 min 
at about 30°C. 

40 32. A method according to claim 24 in which the major secretory protein of L. lactis is produced. 

33. The major secretory protein of L lactis in pure form having the amino acid given in SEQ ID No. 1 . 

34. A method for the production of the protein according to claim 33 comprising the steps of (a) precipitating the proteins 
45 from the supernatant of L. lactis strain LM0230 as deposited under the accession number DSM 5805 with trichlo- 
roacetic acid, (b) electrophoresis of the precipitated protein on a SDS polyacrylamide gel, (c) cutting out the region 
with the major protein band and (d) electroeluting the protein from the gel. 



so Patentanspruche 

1 . Hybridvektor, der aufweist: 

a) das etwa 3,5-kbp-EcoRI/Sall-L-lactis-lnsert des Plasmids pUCRS, das unter der Zugangs-IMr. DSM 5803 
55 hinterlegt ist, Oder ein funktionelles Fragment davon, das die Promotor-, Signal- oder Strukturfunktionen bei- 

behalt, oder 

b) eine DNA-Sequenz, die mrt dem Insert oder mit einem funktionellen Fragment davon hybridisiert oder eine 
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derartige hybridisierende DNA-Sequenz, die betriebsbereit an ihre naturliche Promotor-Region gebunden ist, 
oder 

c) eine degenerierte Sequenz der in a) angegebenen Sequenz, die ein Signalpeptid kodiert. 

5 

2. Hybridvektor nach Anspruch 1, der den Replikationsursprung eines Plasmids enthalt, das zur gleichen Inkompa- 
tibilitatsgruppe wie das 2,5-Md- oder das 5 : 2-Md-Plasmid des L.-lactis-Stammes LL712, der unter der Zugangs- 
Nr. DSM 5804 hinterlegt ist, gehort, wobei der Ursprung aus dem etwa 1,8 kbp langen Ndel/Sphl -Fragment des 
2,5-Md-Plasmids von L. lactis LL712 oder dem 1 ,0-kbp-Ndel/Hindlll-Fragment des 5,2-Md-Plasmids von L lactis 

10 LL712 erhaltlich ist. 

3. Hybridvektor nach Anspruch 1 oder 2, der das etwa 3,5-kbp-EcoRI/Satl-L-lactis-lnsert des Plasmid pUCRS, das 
unter der Zugangs-Nr. DSM 5803 hinterlegt ist, oder ein funktionelles Fragment davon enthalt. 

15 4. Hybridvektor nach Anspruch 1 oder 2, wobei das funktionelle Fragment die Promotorf unktionen des Hauptsekre- 
tionsproteingens beibehalt. 

5. Hybridvektor nach Anspruch 4, wobei das die Promotorf unktion beibehaltende funktionelle Fragment die Region 
1500 bp stromaufwarts der das Signalpeptid kodierenden DNA-Sequenz uberspannt. 

20 

6. Hybridvektor nach Anspruch 5, wobei das die Promotorf unktion beibehaltende funktionelle Fragment eine Region 
zwischen 100 bp bis zu 1000 bp stromaufwarts der das Signalpeptid kodierenden DNA-Sequenz uberspannt. 

7. Hybridvektor nach Anspruch 1 oder 2, wobei das funktionelle Fragment die Funktion der Hauptsekretionsprotein- 
25 signalsequenz beibehalt. 

8. Hybridvektor nach Anspruch 7, wobei das die Signalfunktion beibehaltende funktionelle Fragment die in SEQ-ID- 
Nr. 2 offenbarte 27 Aminosauren lange Signalsequenz kodiert. 

30 9. Hybridvektor nach Anspruch 7, wobei das die Signalfunktion beibehaltende funktionelle Fragment sich von einer 
Base bei etwa der Position 411 der DNA-Sequenz mit der SEQ-ID-Nr. 1 bis etwa zur Position 491 erstreckt. 

1 0. Hybridvektor nach Anspruch 3, wobei das funktionelle Fragment die Promotor- und die Signalf unktionen des Haupt- 
sekretionsproteingens beibehalt. 

35 

11. Hybridvektor nach Anspruch 10, wobei das funktionelle Fragment aus der Gruppe von Fragmenten ausgewahlt 
ist, die sich von einer beliebigen Base zwischen dem EcoRI-Schnittende des etwa 3,5-kbp-EcoRI/Sall-L-lactis- 
Inserts des unter der Zugangs-Nr. DSM 5803 hinterlegten Plasmids pUCRS und der ersten Hindlll-Restriktions- 
stelle, die sich in diesem Insert stromaufwarts von der Base befinden, die Position 1 der DNA-Sequenz mit der 

40 SEQ-ID-Nr. 1 entspricht, bis etwa zu der Base, die der Position 491 der DNA-Sequenz mit der SEQ-ID-Nr. 1 

entspricht, erstrecken. 

12. Hybridvektor nach Anspruch 3, wobei sich das funktionelle Fragment von einer Base bei etwa der Position 492 
bis etwa zur Position 1793 der DNA-Sequenz mit SEQ-ID-Nr. 1 erstreckt. 

45 

13. Hybridvektor nach Anspruch 1 oder 2, der eine DNA-Sequenz, die mit dem etwa 3,5-kbp-EcoRI/Sall-L-lactis-lnsert 
des unter der Zugangs-Nr. DSM 5803 hinterlegten Plasmids pUCRS, oder mit einem f unktionellen Fragment davon, 
hybridisiertoder eine an ihre naturliche Promotor-Region betriebsbereit gebundene derartige hybridisierende DNA- 
Sequenz enthalt. 

so 

14. Hybridvektor nach Anspruch 1 3, der eine DNA-Gensequenz enthalt, die mit dem das Hauptsekretionsprotein von 
L. lactis kodierenden Gen oder einem funktionellen Fragment davon hybridisiert. 

15. Hybridvektor nach Anspruch 1, der eine DNA-Sequenz enthalt, die ein Signalpeptid nach Anspruch 1 c) kodiert. 

55 

16. Hybridvektor nach Anspruch 1 oder 2, der ein homologes oder heterologes Strukturgen enthalt, das betriebsbereit 
in einem richtigen Leseraster zu der das Signalpeptid des Hauptsekretionsproteins kodierenden DNA-Sequenz 
und einem Promotor gebunden ist. 
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1 7. Hybridvektor nach Anspruch 1 6 : der Desulfatohirudih Oder das Hauptsekretionsprotein des L.-Lactis-Stammes LM 
0230, wie in der SEQ-ID-Nr. 1 beschrieben, kodiert. 

18. DNA-Molekul, welches: 

5 

a) das etwa 3,5-kbp-EcoRI/Sall-L-lactis-lnsert des Plasmids pUCRS, das unter der Zugangs-Nr. DSM 5803 
hinterlegt ist, oder ein funktionelles Fragment davon, das die Promoter-, Signal- Oder Strukturfunktionen bei- 
behalt, ist oder 

io b) mit dem Insert oder mit einem funktionellen Fragment davon hybridisiert oder eine derartige hybrid is ie rend e 

DNA-Sequenz ist, die betriebsbereit an ihre naturliche Promotor-Region gebunden ist, oder 

c) eine degenerierte Sequenz der in a) angegebenen Sequenz, die ein Signalpeptid kodiert, ist. 

is 19. Hybrid-DNA-Molekul nach Anspruch 18, das den Replikationsursprung eines Plasmids enthalt, das zur gleichen 
Inkompatibilitatsgruppe wie das 2,5-Md- oder das 5,2-Md-Plasmid des L.-lactis-Stammes LL712, der unter der 
Zugangs-Nr. DSM 5804 hinterlegt ist, gehort, wobei der Ursprung aus dem etwa 1,8 kbp langen Ndel/Sphl -Frag- 
ment des 2,5-Md-Plasmids von L lactis LL712oderdem 1 ,0-kbp-Ndel/Hindl II -Fragment des5,2-Md-Plasmidsvon 
L lactis LL712 erhaltlich ist. 

20 

20. Wirt, der mit einem Hybridvektor nach Anspruch 1 oder 2 transformiert ist. 

21. Transform ierter Wirt nach Anspruch 20, wobei der Wirt unter den E.-coli-Stammen TG1, C600 und HB101, L- 
cremoris-Strammen, plasmidfreien L-lactis-Stammen und Bacillus-Stammen ausgewahlt ist. 

2S 

22. Transform ierter Wirt nach Anspruch 20, der mit einem Hybridvektor nach Anspruch 1 7 transformiert ist. 

23. Verfahren zur Herstellung eines transformierten Wirts nach Anspruch 20, bei dem (a) ein Wirt unter transformie- 
renden Bedingungen mit einem Hybridvektor nach Anspruch 1 oder 2, gegebenenfalls zusammen mit einem Se- 

30 lektionsmarkergen, behandelt und (b) die Transformanden selektiert werden. 

24. Verfahren zur Herstellung eines Polypeptids, bei dem (a) ein homologes oder heterologes Strukturgen im richtigen 
Leseraster mit der das Signalpeptid kodierenden DNA-Sequenz nach Anspruch 9 fusioniert, (b) ein geeigneter 
Wirt mit einem ein in (a) hergestelltes Fusionsgen enthaltenden Hybridvektor transformiert, (c) das von dem Fu- 

35 sionsgen kodierte Polypeptid von den Wirtszellen sezerniert und, sofern erforderlich, (d) das Polypeptid nach 

herkommlichen Verfahren aus dem Uberstand isoliert wird. 

25. Verfahren nach Anspruch 24, wobei ein unter Lactococcus-Arten und Bacillus-Arten ausgewahlter Wirt verwendet 
wird. 

40 

26. Verfahren nach Anspruch 24, wobei ein bakterieller Wirt verwendet wird, der in sein Kulturmedium keine das 
exprimierte Protein abbauende Protease sezerniert. 

27. Verfahren nach Anspruch 24, wobei Desulfatohirudin gebildet wird. 

45 

28. Verfahren nach Anspruch 27, wobei Zellen des unter der Zugangs-Nr. DSM 5805 hinterlegten L.-lactis-Stammes 
LM0230 verwendet werden. 

29. Verfahren nach Anspruch 24, 
50 dadurch gekennzeichnet, daG 

(a) die transformierten Zellen entweder in der log-Phase oder in der stationaren Phase der Kultur aus dem Nahr- 
medium gesammelt werden, (b) sie in einem kleineren vblumen frischen Nahrmediums Oder geeigneter Pufferlo- 
sung resuspendiert und (c) sie vor der Isolierung des gewunschten Produktes aus dem Uberstand weiter inkubiert 
werden. 

55 

30. Verfahren nach Anspruch 29, 
dadurch gekennzeichnet, daB 

die transformierten Zellen in etwa 1 % bis zu 20 % des ursprunglichen Kulturvolumens resuspendiert werden. 
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31. Verfahren nach Anspruch 30, wobei die transformierten Zellen Zellen des unter der Zugangs-Nr. DSM 5805 hin- 
terlegten L.-lactis-Stammes LM0230 sind, die mit einem Desulfathohirudin kodierenden Hybridvektor nach An- 
spruch 17 transformiert sind und in etwa 1/10 Vol. frischem Nahrmedium resuspendiert und etwa 30 min bei etwa 
30*0 inkubiert werden. 

5 

32. Verfahren nach Anspruch 24, wobei das Hauptsekretionsprotein von L. lactis gebildet wird. 

33. Hauptsekretionsprotein von L. lactis in reiner Form mit der in der SEQ-ID-Nr. 1 angegebenen Aminosauresequenz. 

10 34. Verfahren zur Herstellung des Proteins nach Anspruch 33, bei dem (a) die Proteine aus dem Uberstand des unter 
der Zugangs-Nr. DSM 5805 hinterlegten L-lactis-Stammes LM0230 mit Trichloressigsaure ausgefallt, (b) das aus- 
gefallte Protein auf einem SDS-Polyacrylamidgel elektrophoretisiert, (c) die Region mit der Hauptproteinbande 
ausgeschnitten und (d) das Protein aus dem Gel elektroeluiert wird. 

15 

Revendications 

1. Vecteur hybride comprenant 

20 a) le segment d'insertion de L lactis EcoRI/Sall d'approximativement 3,5 kpb du plasmide pUCRS, depose 

sous le n° de depot DSM 5803, ou un de ses fragments fonctionnels conservant les fonctions de promoteur, 
de signal ou des fonctions structurales, ou 

b) une sequence d'ADN qui s'hybride avec ledit segment d'insertion ou avec un fragment fonctionnel de ce 
segment ou une telle sequence d'ADN hybridante liee de maniere fonctionnelle a sa region de promoteur 

25 naturelle, ou 

c) une sequence degener6e d'une sequence d'ADN qui repond a la definition en a) et qui code pour un peptide- 
signal. 

2. Vecteur hybride suivant la revendication 1 , comprenant I'origine de replication d'un plasmide appartenant au meme 
30 groupe d'incompatibilite que le plasmide de 2,5 Md ou de 5,2 Md de la souche L lactis LL71 2 deposee sous le n° 

de depot DSM 5804, origine qui peut etre obtenue a partir du fragment Ndel/Sphl d'approximativement 1,8 kpb 
de longueur du plasmide de 2,5 Md de L lactis LL712 ou du fragment Ndel/Hindlll de 1 ,0 kpb du plasmide de 5,2 
Md de L lactis LL71 2. 

35 3. Vecteur hybride suivant la revendication 1 ou 2, comprenant le segment d'insertion de L. lactis EcoRI/Sall d'ap- 
proximativement 3,5 kpb du plasmide pUCRS depose sous le n° de depot DSM 5803, ou un de ses fragments 
fonctionnels. 

4. Vecteur hybride suivant la revendication 1 ou 2, dans lequel le fragment fonctionnel conserve les fonctions de 
40 promoteur du gene de proteine secretoire majeure. 

5. Vecteur hybride suivant la revendication 4, dans lequel le fragment fonctionnel conservant la fonction de promoteur 
s'etend sur la region de 1500 pb en amont de la sequence d'ADN codant pour le peptide-signal. 

45 6. Vecteur hybride suivant la revendication 5, dans lequel le fragment fonctionnel conservant la fonction de promoteur 
s'etend sur une region allant de 100 pb jusqu'a 1000 pb en amont de la sequence d'ADN codant pour le pepti&e- 
signal. 

7. Vecteur hybride suivant la revendication 1 ou 2, dans lequel le fragment fonctionnel conserve la fonction de la 
so sequence signal de la proteine secretoire majeure. 

8. Vecteur hybride suivant la revendication 7, dans lequel le fragment fonctionnel conservant la fonction de signal 
code pour la sequence-signal de 27 amino-acides de longueur decrite dans la SEQ ID n° 2. 

55 9. Vecteur hybride suivant la revendication 7, dans lequel le fragment fonctionnel conservant la fonction de signal 
s'etend d'une base approximativement a la position 411 de la sequence d'ADN ayant la SEQ ID n° 1 jusqu'a 
approximativement la position 491. 
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10. Vecteur hybride suivant la revendication 3 : dans lequel le fragment lonctionnel conserve la fonction de promoteur 
et la fonction de signal du gene de la proteine secretoire majeure. 

11. Vecteur hybride suivant la revendication 10, dans lequel le fragment fonctionnel est choisi dans le groupe de 
5 fragments s'Stendant de n'importe quelle base entre I'extrSmitS coupee EcoRI du segment d'insertion de L lactis 

EcoRI/Sall d'approximativement 3,5 kpb du plasmide pUCRS depose sous le n° de depot DSM 5803 et le premier 
site de restriction Hindi II situe dans ledrt segment d' insertion en amont de la base correspondant a la position 1 
de la sequence d'ADN ayant la SEQ ID n° 1 jusqu'a approximativement la base correspondant a la position 491 
de la sequence d'ADN ayant la SEQ ID n° 1. 

10 

12. Vecteur hybride suivant la revendication 3, dans lequel le fragment fonctionnel s'etend d'une base approximative- 
ment a la position 492 jusqu'a approximativement la position 1793 de la sequence d'ADN ayant la SEQ ID n° 1 . 

13. Vecteur hybride suivant la revendication ou 2, comprenant une sequence d'ADN qui s'hybride avec le segment 
75 d'insertion de L lactis EcoRI/Sall d'approximativement 3,5 kpb du plasmide pUCRS, depose sous le n° de depdt 

DSM 5803, ou avec un de ses fragments fonctionnels, ou bien une telle sequence d'ADN hybridante Nee de maniere 
fonctionnelle a sa region de promoteur naturelle. 

1 4. Vecteur hybride suivant la revendication 1 3, comprenant une sequence de gene d'ADN s'hybridant au gene codant 
20 pour la proteine secretoire majeure de L lactis ou a un de ses fragments fonctionnels. 

15. Vecteur hybride suivant la revendication 1, comprenant une sequence d'ADN qui code pour un peptide-signal 
suivant la revendication 1c). 

25 16. Vecteur hybride suivant la revendication 1 ou 2, comprenant un gene structural homologue ou heterologue, qui 
est lie de maniere fonctionnelle en cadre de lecture convenable a la sequence d'ADN codant pour le peptide-signal 
de la proteine secretoire majeure et un promoteur. 

17. Vecteur hybride suivant la revendication 16, codant pour la desulfatohirudine ou la proteine secretoire majeure de 
30 la souche L lactis LM0230 decrite dans la SEQ ID n° 1. 

18. Molecule d'ADN qui 

a) est le segment d'insertion de L lactis EcoRI/Sall d'approximativement 3,5 kpb du plasmide pUCRS, depose 
35 sous le n° de depot DSM 5803, ou un de ses fragments fonctionnels conservant les fonctions de promoteur, 

de signal ou des fonctions structurales, ou 

b) s'hybride avec ledit segment d'insertion ou avec un fragment fonctionnel de ce segment, ou bien est une 
telle sequence d'ADN hybridante liee de maniere fonctionnelle a sa region de promoteur naturelle, ou 

c) est une sequence degeneree d'une sequence d'ADN qui est incluse dans a) et qui code pour un peptide- 
40 signal. 

19. Molecule d'ADN hybride suivant la revendication 18, comprenant I'origine de replication d'un plasmide appartenant 
au meme groupe d'incompatibilite que le plasmide de 2,5 Md ou 5,2 Md de la souche L lactis LL712 deposee 
sous le n° de depot DSM 5804, origine qui peut etre obtenue a partir du fragment Ndel/Sphl d'approximativement 

45 1,8 kpb de longueur du plasmide de 2,5 Md de L lactis LL712 ou du fragment Ndel/Hindlll d'approximativement 

1,0 kpb de longueur du plasmide de 5,2 Md de L lactis LL712. 

20. H6te transforms avec un vecteur hybride suivant la revendication 1 ou 2. 

50 21 . Hote transforme suivant la revendication 20, dans lequel I'hote est choisi dans le groupe consistant en les souches 
E. coli TG 1 , C600 et HB101 ; des souches de L cremoris ; des souches de L lactis et des souches de Bacillus 
depourvues de plasmides. 

22. Hote transforms suivant la revendication 20, qui a ete transforms avec un vecteur hybride suivant la revendication 
55 17. 

23. ProcSdS pour la preparation d'un hote transforms suivant la revendication 20, comprenant les Stapes consistant 
(a) a traiter un hote dans des conditions de transformation avec un vecteur hybride suivant la revendication 1 ou 
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2, facultativement en association avec un gene marqueur de selection, et (b) a selectionner les transformants. 

24. Proc6d6 pour la transformation d'un polypeptide, comprenant les Stapes consistant (a) a fusionner un gene struc- 
tural homologue ou heterologue en cadre de lecture convenable avec la sequence d'ADN codant pour le peptide- 

5 signal suivant la revendication 9, (b) a transformer un note convenable avec un vecteur hybride comprenant un 

gene fusionne prepare en (a), (c) a provoquer la secretion du polypeptide code par (edit gene de fusion par les 
cellules notes et, lorsque cela est requis, (d) a isoler le polypeptide du surnageant par des procedes classiques. 

25. Precede suivant la revendication 24, dans lequel un note choisi dans le groupe d'hotes consistant en Lactococcus 
10 spec, et Bacillus spec, est utilise. 

26. Procede suivant la revendication 24, dans lequel est utilise un note bacterien qui ne secrete pas dans son milieu 
de culture une protease qui degrade la proteine exprimee. 

is 27. Procede suivant la revendication 24, dans lequel de la desulfatohirudine est produite. 

28. Procede suivant la revendication 27, dans lequel des cellules de la souche L lactis LM0230, deposee sous le n° 
de d6pot DSM 5805, sont utilisees. 

20 29. Procede suivant la revendication 24, caracterise en ce que (a) les cellules transformers sont collectees a partir 
du milieu nutritif en phase logarithmique ou en phase stationnaire de la culture, (b) elles sont remises en suspension 
dans un plus petit volume de milieu nutritif frais ou de solution tampon convenable, et (c) elles sont en outre mises 
en incubation avant d'isoler le produit desire du surnageant. 

25 30. Procede suivant la revendication 29, caracterise en ce que les cellules transformees sont remises en suspension 
dans un volume d'environ 1 % jusqu'a 20 % du volume de culture initial. 

31. Procede suivant la revendication 30, dans lequel les cellules transformees sont des cellules de la souche L lactis 
LM0230, deposee sous le n° de depot DSM 5805 transformees avec un vecteur hybride suivant la revendication 

30 17 codant pour la desulfatohirudine et sont remises en suspension dans approximativement 1/10 de volume de 

milieu nutritif frais et sont mises en incubation pendant un temps d'environ 30 minutes a une temperature d'environ 
30 °C. 

32. Procede suivant la revendication 24, dans lequel la proteine secretaire majeure de L. lactis est produite. 

35 

33. Proteine secretaire majeure de L. lactis sous forme pure, ayant la sequence d'amino-acides mentionnee dans la 
SEQIDnM. 

34. Procede pour la production de la proteine suivant la revendication 33, comprenant les etapes consistant (a) a 
40 precipiter les prolines a partir du surnageant de la souche L lactis LM0230, deposee sous le n° de depot DSM 

5805, avec de I'acide trichloracetique, (b) a soumettre a une electrophorese la proteine precipitee sur un gel de 
polyacrylamide au SDS, (c) a exciser la region portant la bande de proteine principale, et (d) a electroeluer la 
proteine hors du gel. 

45 
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Figure 1 
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Figure 2 
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Figure 3 



(BamHI) Hindlll (0) 
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